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In search of credit 


Explicit recognition of researchers’ contributions to science is becoming more comprehensive. Not 
before time — especially as ameans of crediting referees. 


authors have it indicated in a footnote that they were joint sec- 

ond authors on a paper? We refused — for better or worse, our 
policy is to allow no more than three authors in first and last positions 
ona paper. But authorship order is a much greater obsession in some 
disciplines than others (the example in question was from biology). 
And there could hardly be a more clumsy way of indicating credit — 
not to mention the disputes that it provokes among co-authors. 

For several years, Nature and the Nature research journals have 
insisted that each author's contribution should be indicated in a state- 
ment at the end of any paper. However, these statements are not sys- 
tematic, and are not accompanied by metadata to make them more 
searchable. So although this approach works reasonably well in indi- 
cating who did what on a particular paper, there is potential for such 
statements to cumulatively provide a database of the skills and expe- 
rience of individual researchers. Through such statements, it could 
become transparently clear that, say, John Smith was responsible for 
the development ofa particular technique and had applied it in multi- 
ple contexts. At Nature, we are working on ways to increase the utility 
of author contribution statements and so achieve such transparency. 

Of course, it would help to know which John Smith we are talking 
about. And here is where last year’s launch of the Open Researcher and 
Contributor ID (ORCID) facility is to be welcomed. The core func- 
tion of ORCID — a community collaboration (see go.nature.com/ 
sy3qnp) — is to assign every researcher a number and a web page, 
thereby providing a unique identifier and so disambiguation. The web 
page enables the researcher to record their contributions: papers they 
have published and — a facility to come — their research grants and 


| ast year, this journal received an unusual request: could three 


patents. Nature journals authors can link their ORCID to their account 
in our manuscript submission and tracking system, and we will soon 
be publishing authors’ ORCIDs in papers. (Readers can register for 
ORCID here: https://orcid.org/register; see also Nature 485, 564; 2012.) 

In contrast to such public activities, refereeing tends to be a private 
affair, whether for funding agencies or for journals. But it is of immense 
value and deserves its own credit. Referees can examine a submission 
only for its surface validity rather than for its deeper truth, but that in 
itself involves a substantial commitment. Some may devote days to the 
task if they are sufficiently stimulated or worried. The more that can be 
done to reward such dedication the better. 

That is why Nature and the Nature journals have introduced two 
ways in which referees can be given credit. Any referee who, ina given 
year, has refereed three or more papers for any of the journals will 
receive a letter acknowledging their contribution and a free subscrip- 
tion to their choice of one of the journals. More importantly, we have 
recently introduced a system by which our referees can download a 
statement of the number of papers they have refereed for us. This report 
is available by logging into the ‘My Account’ page on any Nature jour- 
nal’s manuscript submission and tracking system and reflects the refer- 
eeing activity across all Nature journals. Ifnothing else, such statements 
provide a formal reference that someone can pass on to employers, 
government agencies and others enlightened enough to appreciate the 
value of such contributions. 

All of these developments are ways in which researchers can gain 
explicit credit for contributions that have previously relied more on 
word of mouth. This is a trend that we will continue to support and 
encourage. m 


Safety catch 


International laboratory survey offers 
comfort — and caution. 


Garrison Keillor, “all the women are strong, all the men are good- 
looking, and all the children are above average”. In keeping with 
Keillor’s gentle dig at the inflations of self-bias, if Lake Wobegon had 
research laboratories you can be sure that all the experiments would 
work, all the results would be significant and all the scientists 
would work safely. 
This week, Nature reports the initial analysis of results from the first 
international survey of scientists’ attitudes and behaviour towards lab 
safety, conducted by the University of California, Los Angeles, together 


I Lake Wobegon, the fictional town invented by the US humorist 


with Nature Publishing Group (see page 9). The analysis hints at a Lake 
Wobegon bias in perceptions about safety: one-third of scientists say that 
safety is more important to them than it is to their colleagues, with only 
2% voting the other way. Although most respondents say that their labs 
are safe places to work, they simultaneously report behaviour, such as 
frequent lone working, that seems to belie that confidence. 

The survey was done to improve understanding of lab safety 
culture. Health-and-safety officers have long complained of a lack of 
international data. It would be premature to draw immediate conclu- 
sions from the quantitative results — for example, almost half the 
respondents reported being injured in the lab — because few other 
comparable data have been collected. But the results do caution against 
complacency. 

So, as you return to your laboratories in the New Year, look around 
the benches, observe your own working practices and those of your 
colleagues, and evaluate your relationships with supervisors and safety 
officers. Not everyone can be above average — but awareness of how 
perception clashes with reality can help lift standards for all. m 
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WORLD VIEW pernisicor sen 


politicization, here’s a New Year's resolution for scientists, espe- 

cially in the United States: gain the confidence of people and 
politicians across the political spectrum by demonstrating that science 
is bipartisan. 

That President Barack Obama chose to mention “technology, discov- 
ery and innovation’ in his passionate victory speech in November shows 
just how strongly science has come, over the past decade or so, to be a 
part of the identity of one political party, the Democrats, in the United 
States. The highest-profile voices in the scientific community have avidly 
pursued this embrace. For the third presidential election in a row, dozens 
of Nobel prizewinners in physics, chemistry and medicine signed a letter 
endorsing the Democratic candidate. 

The 2012 letter argued that Obama would 
ensure progress on the economy, health and the 
environment by continuing “America’s proud 
legacy of discovery and invention’, and that his 
Republican opponent, Mitt Romney, would 
“devastate a long tradition of support for public 
research and investment in science”. The signa- 
tories wrote “as winners of the Nobel Prizes in 
Science’, thus cleansing their endorsement of the 
taint of partisanship by invoking their authority 
as pre-eminent scientists. 

But even Nobel prizewinners are citizens with 
political preferences. Of the 43 (out of 68) sig- 
natories on record as having made past politi- 
cal donations, only five had ever contributed to 
a Republican candidate, and none did so in the 
last election cycle. If the laureates are speaking on 
behalf of science, then science is revealing itself, 
like the unions, the civil service, environmentalists and tort lawyers, 
to be a Democratic interest, not a democratic one. 

This is dangerous for science and for the nation. The claim that 
Republicans are anti-science is a staple of Democratic political rheto- 
ric, but bipartisan support among politicians for national investment in 
science, especially basic research, is still strong. For more than 40 years, 
US government science spending has commanded a remarkably sta- 
ble 10% of the annual expenditure for non-defence discretionary pro- 
grammes. In good economic times, science budgets have gone up; in 
bad times, they have gone down. There have been more good times than 
bad, and science has prospered. 

In the current period of dire fiscal stress, one way to undermine this 
stable funding and bipartisan support would be to convince Republi- 
cans, who control the House of Representatives, 


r lo prevent science from continuing its worrying slide towards 


that science is a Democratic special interest. > NATURE.COM 
This concern rests on clear precedent. Con- _ Discuss this article 

servatives in the US government havelongbeen _ online at: 

hostile to social science, which they believe tilts _go.nature.com/fiw8gs 
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Science must be seen to 
bridge the political divide 


Scientists in the United States are often perceived as a Democratic interest 
group. For science’s sake this has to change, argues Daniel Sarewitz. 


towards liberal political agendas. Consequently, the social sciences 
have remained poorly funded and politically vulnerable, and every so 
often Republicans threaten to eliminate the entire National Science 
Foundation budget for social science. 

As scientists seek to provide policy-relevant knowledge on complex, 
interdisciplinary problems ranging from fisheries depletion and carbon 
emissions to obesity and natural hazards, the boundary between the nat- 
ural and the social sciences has blurred more than many scientists want 
to acknowledge. With Republicans generally sceptical of government's 
ability and authority to direct social and economic change, the enthusi- 
asm with which leading scientists align themselves with the Democratic 
party can only reinforce conservative suspicions that for contentious 
issues such as climate change, natural-resource 
management and policies around reproduction, 
all science is social science. 

The US scientific community must decide if 
it wants to be a Democratic interest group or if 
it wants to reassert its value as an independent 
national asset. If scientists want to claim that their 
recommendations are independent of their politi- 
cal beliefs, they ought to be able to show that those 
recommendations have the support of scientists 
with conflicting beliefs. Expert panels advis- 
ing the government on politically divisive issues 
could strengthen their authority by demonstrat- 
ing political diversity. The National Academies, as 
well as many government agencies, already try to 
balance representation from the academic, non- 
governmental and private sectors on many science 
advisory panels; it would be only a small step to 
be equally explicit about ideological or political 
diversity. Such information could be given voluntarily. 

To connect scientific advice to bipartisanship would benefit politi- 
cal debate. Volatile issues, such as the regulation of environmental and 
public-health risks, often lead to accusations of ‘junk science’ from 
opposing sides. Politicians would find it more difficult to attack 
science endorsed by avowedly bipartisan groups of scientists, and more 
difficult to justify their policy preferences by scientific claims that were 
contradicted by bipartisan panels. 

During the cold war, scientists from America and the Soviet Union 
developed lines of communication to improve the prospects for 
peace. Given the bitter ideological divisions in the United States today, 
scientists could reach across the political divide once again and set an 
example for all. m 


Daniel Sarewitz is co-director of the Consortium for Science, 
Policy and Outcomes at Arizona State University, and is based in 
Washington DC. 

e-mail: daniel.sarewitz@asu.edu 
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SEVEN DAYS nescnsn 


POLICY 


Polio setback 


Pakistan has partly suspended 
a three-day vaccination 
campaign led by the Global 
Polio Eradication Initiative 
that focused on the country’s 
polio hotspots, after gunmen 
killed nine local health workers 
between 17 and 19 December 
last year. Pakistan is one of only 
three countries where polio 
has not been eradicated. On 

1 January this year, seven health 
workers and teachers working 
for a Pakistan charity were also 
killed by gunmen in northwest 
Pakistan. No organization has 
yet claimed responsibility for 
the shootings, but militants 
linked to the Taliban are 
suspected. See go.nature.com/ 
xzsc4s for more. 


Fisheries progress 
Commercial fishing in Europe 
is closer to being set ona 

more scientific footing, after 
members of the European 
Parliament's fisheries 
committee voted through 
reforms to the European 
Union's fisheries policies. The 
changes include the stipulation 
that levels for catches be 

set on the basis of scientific 


TREND WATCH 


The Large Hadron Collider 


(LHC) ended 2012 with a bang, 
or rather 6 million billion billion 


bangs. That's the number of 
proton-proton collisions the 


accelerator has produced since 
it began running in 2010. Only 


5 billion ‘collisions of interest’ 
were recorded, but that’s still 


60 petabytes of data — enough 
to fill the hard drives of around 


80,000 laptop computers. The 
LHC will remain closed until 


2015 for an energy upgrade. That 
will give physicists time to catch 


up with their analysis. 


advice. The proposals must 
next be voted on by the whole 
parliament and then be agreed 
on by ministers. See go.nature. 
com/hcm4yv for more. 


TB drug approval 
The US Food and Drug 
Administration has for the 
first time approved a medicine 
to treat multidrug-resistant 
tuberculosis (TB). Bedaquiline 
was approved on 28 December 
as part of acombination 
therapy. Made by Janssen 
Therapeutics of Titusville, 
New Jersey, the drug works 

by inhibiting a mycobacterial 
ATP synthase enzyme. For 
more on forthcoming TB 
therapies, see page 14 and 
Nature 487, 413-414 (2012). 


Science in Korea 
Park Geun-hye, elected South 
Korea's first female president 
on 19 December, has promised 
to make science a cornerstone 
of the government’ policy. 
Park’s conservative Saenuri 
party has also said that it will 
increase spending on research 
and development from 4% 

(in 2011) to 5% of the nation’s 
gross domestic product by 
2017. The proportion of that 
money going to basic research 


PARTICLE DATA PILE UP 


will also rise from 35.2% to 
40% over the same period, 
Park said. See go.nature.com/ 
on2Igh for more. 


EPA chief quits 


US Environmental Protection 
Agency (EPA) administrator 
Lisa Jackson will step down 
early this year after four years 
in the job, she announced 

on 27 December. Jackson 
championed tighter pollution 
standards and worked to rein 
in greenhouse-gas emissions. 
It is unclear who will take 
over her position, but deputy 
EPA administrator Robert 
Perciasepe is likely to step in 
until the next appointee is 
approved. See go.nature.com/ 
jzaiyn for more. 


Nobel laureate dies 
Rita Levi-Montalcini, a Nobel- 
prizewinning neuroscientist 
who became a national 
heroine in her home country 
of Italy, died on 30 December, 
aged 103. Levi-Montalcini 
shared the 1986 Nobel Prize 
in Physiology or Medicine for 
her work on the discovery of 
nerve growth factor. In her 
last decades she was made 
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a senator for life in Italy’s 
parliament, and created sparks 
by blocking legislation that 
might have been unfavourable 
to research. See go.nature. 
com/li2pjn for more. 


Microbiologist dies 
Carl Woese, a microbiologist 
who established that archaea 
are a third domain of life, died 
on 30 December, aged 84. In 
1977, Woese and colleagues 

at the University of Illinois 

at Urbana-Champaign 
examined evolutionary 
relationships using ribosomal 
RNA sequences to show that 
prokaryotes comprise two 
different groups: bacteria and 
archaea. His theory was not 
accepted until the mid-1980s. 


| ERESEARCH 
Antarctic lake flop 


UK scientists have abandoned 
their efforts to drill through 

3 kilometres of ice to reach 

the subglacial Lake Ellsworth 
in Antarctica. The team from 
the British Antarctic Survey 
had planned to use high- 
pressure hot water to bore 
through the ice, but burned too 
much fuel trying to connect 
boreholes. The researchers 

say that they will try again in 
future Antarctic summers. See 
go.nature.com/xcclen for more. 
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An international poll provides a lens into lab workers’ attitudes to workplace welfare. 


WORKPLACE 


Safety survey 
reveals lab risks 


Questionnaire suggests researchers not as safe as they feel. 


BY RICHARD VAN NOORDEN 


rity about the safety of their laboratories, 

according to early results from the first 
international survey of researchers’ workplace 
attitudes and practices. 

Some 86% of the roughly 2,400 scientists 
who responded said that they believe their 
labs are safe places to work. Yet just under 
half had experienced injuries ranging from 
animal bites to chemical inhalation, and large 
fractions noted frequent lone working, unre- 
ported injuries and insufficient safety training 
on specific hazards (see ‘A question of safety’). 

“Understanding this disparity will be key 


cientists may have a false sense of secu- 


» 


to positively changing safety culture,” says 
James Gibson, head of environmental health 
and safety at the University of California, Los 
Angeles (UCLA). The university’s Center for 
Laboratory Safety, a research initiative set up 
in March 2011, commissioned the study as 
part of a wave of US-led efforts to examine 
safety culture following the shocking death of 
a 23-year-old research assistant, Sheharbano 
Sangji. She received horrific burns ina UCLA 
lab fire four years ago (see Nature http:// 
doi.org/dnws3n; 2009), and her supervisor, 
organic chemist Patrick Harran, may face a 
criminal trial over her death. Other incidents, 
including a second lab death, at Yale Univer- 
sity in New Haven, Connecticut, in 2011 (see 
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Nature 472, 270-271; 2011), have added to 
the concerns. 

The study “is the most comprehensive 
attempt at gathering data on attitudes to safety 
that I've seen — and one more piece of infor- 
mation in a growing body of reports that point 
to the need to improve the culture around 
safety in our academic laboratories,” says 
Dorothy Zolandz, director of the US National 
Academies Board on Chemical Sciences and 
Technology. Nature Publishing Group,the 
publisher of Nature, helped to launch the sur- 
vey, as did the firm BioRAFT, which provides 
software for safety compliance and receives 
investment from Digital Science, a sister com- 
pany to Nature Publishing Group. UCLA's 
Center for Laboratory Safety plans to analyse 
the data more closely later this year, but shared 
early results with Nature. 


PART AND PARCEL 

Some of the anonymized survey participants 
— who were mostly from the United States and 
United Kingdom, but also hailed from Europe, 
China and Japan — felt that any injuries they 
sustained were just part of the job. “Was 
scratched by a monkey,’ one scientist wrote. 
“Tt’s bound to happen in that line of work, no 
matter how careful you are.” Another was bit- 
ten while extracting venom from rattlesnakes; 
athird reported being sprayed on the face and 
hands with sulphuric acid, leading to US$3,000 
of dermatology treatments. The most com- 
mon injuries were minor — cuts, lacerations 
and needle pricks — but 30% of respondents 
said they had witnessed at least one ‘major’ lab 
injury, something that required attention from 
a medical professional. More than one-quarter 
of junior researchers said that they had expe- 
rienced an injury that they hadn't reported to 
their supervisor. 

Yet the overwhelming majority of respond- 
ents asserted that their labs were safe places to 
work, that they had received sufficient safety 
training to minimize injury and that appropri- 
ate safety measures had been taken to protect 
employees. This level of comfort is similar to 
that found in other, smaller surveys, says Ralph 
Stuart, secretary of the American Chemical 
Society’s health and safety division (which has 
conducted its own surveys on the matter). 

But more specific questions in the survey 
reveal that safety standards are often not 
adhered to. Only 60% said they had received 
safety training on specific hazards or agents > 
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A QUESTION OF SAFETY 


A survey of almost 2,400 scientists shows that although most believe their 


FA in your lab, how frequently do people conduct 


laboratories to be safe, about half have experienced injuries in the workplace. 


It also shows that junior and senior researchers have very different views of 


potentially hazardous practices. 
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they worked with, and around half agreed that 
lab safety could be improved, with chemists 
(60%) most likely to feel this, and neurosci- 
entists (30%) significantly less so. 


OLD VERSUS YOUNG 
One of the biggest gulfs picked up by the survey 
was differences in attitudes to safety between 
those in junior roles (such as postdocs and PhD 
students) and those in more senior positions 
(such as professors, heads of department and 
principal investigators). Around 40% of junior 
scientists said that people worked alone in their 
lab every day — compounding the risk to health 
should an accident occur — compared with just 
26% of senior respondents (see graph 2), raising 
the possibility that supervisors are not always 
aware of the safety culture in their own group. 
Overall, about two-thirds of researchers said 
that people worked alone in their lab at least 
several times a week. And only 12% of younger 
scientists said that safety was “paramount, and 
takes precedence over all other lab priorities’, 
compared with 36% of senior scientists. 
Younger researchers may have a clearer view 
of safety practices: controlling for other factors, 
junior researchers worked longer hours at the 
bench than their bosses. More than half ofjuniors 
worked over 40 hours per week, compared with 
just one-fifth of seniors, with almost 150 people 
overall reporting more than 60 hours per week. 
Another finding — which comes as no sur- 
prise to health and safety experts — was the 
difference in how US and UK scientists assess 
risks before they start an experiment, which is, 
in part, a consequence of differences in legal 
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Other 


requirements. Almost two-thirds of British sci- 
entists said that they used their organization's 
approved form for risk assessments — which 
is mandated by the nation’s Health and Safety 
Executive — compared with only one-quarter 
of Americans. More than half of US scientists 
instead said they assessed risk “informally”. 

The biggest barriers to improving safety in 
the lab were “time and hassle” and “apathy”, 
scientists said. “IfI could have selected apathy 
three times over, I would have,’ one scientist 
wrote. These factors were closely followed by 
lack of understanding of safety requirements, 
lack of leadership and a focus on compliance 
requirements over safety. “Compliance does 
not equal to (sic) safety. More paperwork does 
not equal a safer lab; if anything, it makes it 
less safe,’ wrote one researcher. Another com- 
plained: “Safety training is very obviously 
aimed at instituting blind compliance to avoid 
liability. It is not aimed at teaching lab workers 
about why each safety measure is put in place.” 

Those feelings might explain researchers’ 
mixed attitudes to the value of safety train- 
ing, inspections and safety rules. Two-thirds 
of those surveyed thought that lab inspections 
improved safety, with senior scientists signifi- 
cantly more likely to agree than juniors. Yet 
two-fifths felt that safety training “focused on 
training compliance regulations rather than 
on improving laboratory safety’, although 32% 
disagreed. And close to one-fifth of research- 
ers said that lab safety rules had negatively 
impacted their lab productivity. 

“These respondents are wrong, and this is a 
reflection of an urban myth [about the value 
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of safety procedures] — it is highly frustrat- 
ing,” comments Neal Langerman, who runs 
the consulting company Advanced Chemical 
Safety, based in San Diego, California. 

Some health and safety experts think that 
the survey — which involved almost 100 ques- 
tions — was too broad and unfocused to draw 
definite conclusions. They also criticized its 
non-randomized sampling technique: the sur- 
vey was sent out by e-mail to scientists who had 
registered on nature.com, and to research lead- 
ers, who were encouraged to pass it on to their 
lab scientists. But the experts acknowledged 
that it was a necessary and useful starting point 
for further investigation. 

“This survey is a baseline study that leaves 
more questions than answers, but a perception 
survey is supposed to raise questions that need 
to be looked at?’ said Lou DiBerardinis, head of 
health and safety at the Massachusetts Institute 
of Technology (MIT) in Cambridge. DiBerar- 
dinis is on one of four teams to receive seed 
funding in 2012 from the Center for Labora- 
tory Safety to study safety. He is working on a 
project led by MIT anthropologist Susan Silbey 
to track changing safety cultures by monitor- 
ing inspection records over seven years. 

Zolandz says that this year, the National 
Academies Board on Chemical Sciences and 
Technology will team up with behavioural 
scientists to develop practical guidance for 
researchers on how to establish a better safety 
culture. In the various efforts that have followed 
Sangji’s death, “that’s one piece of the puzzle 
that’s been missing’, she says. “How do you get 
people to buy into safety?” m SEE EDITORIAL P.5 


SOURCE: CENTER FOR LABORATORY SAFETY, UCLA/NPG/BONAMY FINCH 
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New year, new science 


Nature looks ahead to the key findings and events that may emerge in 2013. 


STEM-CELL TRIALS 
Landmark results from an early-stage clini- 
cal trial using human embryonic stem cells 
(hESCs) should appear this year. Biotechnol- 
ogy firm Advanced Cell Technology of Santa 
Monica, California, is injecting hESC-derived 
retinal cells into the eyes of around three dozen 
people with two forms of non-treatable degen- 
erative blindness. It is the only company cur- 
rently testing hESC therapies with US Food 
and Drug Administration (FDA) approval, and 
it hopes that the agency will give it the green 
light to test stem cells induced from adult 
cells in patients this year. 


DIAGNOSTICS CONTROVERSY 

The American Psychiatric Association will 
publish the fifth edition of its Diagnostic 
and Statistical Manual of Mental Disorders 
(DSM-5) in May, the first major update in 
19 years to the standard reference guide 
for diagnosing mental illnesses. It will lead 
to controversial changes in clinical and 
research protocols, including restructured 
diagnoses for autism and major depres- 
sion, although as a ‘living document’ the 
DSM-5 will see further revisions. 


CLIMATE ASSESSMENT 

Climate scientists have spent years pre- 
paring the fifth assessment report from 
the Intergovernmental Panel on Climate 
Change, its first update since 2007. Part of 
that report is due to appear in September: the 
conclusions of Working Group I, which sum- 
marizes the basic science of global warming. In 
the United States, the Global Change Research 
Program’s second assessment will detail the 
national impacts of climate change. 


THE BIG BANG’S GLOW 
One of the stunning images of the year could 
be provided by the comet ISON, which will 
pass close to the Sun in November and could 
outshine the full Moon as its surface boils away 
into space. Just as spectacular will be the Planck 
space telescope’s map of the faint microwave 
afterglow from the Big Bang, which could even 
reveal ripples from gravitational waves gener- 
ated during an initial period of cosmic ‘inflation. 
In other missions, NASA LADEE spacecraft 
will orbit the Moon to 
study lunar dust; its 
MAVEN mission will 
launch to explore Mars’ 
upper atmosphere; and 
the Curiosity rover will 
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continue to send back results from the red 
planet’s surface. Back on Earth, the massive 
66-dish Atacama Large Millimeter/submilli- 
meter Array in Chile will be completed. 


DIET, MICROBES AND CANCER 

Scientists increasingly suspect that our intes- 
tinal zoo of microbes might be the key link 
between diet and diseases such as cancer. A 
study last year connected a higher-than-nor- 
mal proportion of the bacterium Escherichia 
coli to colorectal cancer in mice with inflam- 


The Large Underground Xenon detector is seeking dark matter. 


matory bowel disease (J. C. Arthur et al. Sci- 
ence 338, 120-123; 2012). More studies this 
year will unpick the effect of diets on the gut 
microbiome and their implications for disease 
risk. Meanwhile, GlaxoSmithKline should find 
out whether the FDA approves its melanoma 
treatment trametinib, potentially the first in a 
new class of compounds that inhibit a kinase 
signalling pathway regulating cell growth. 


PARTICLE SEEKING 

After contradictory sightings of dark-matter 
particles from various underground experi- 
ments, the Large Underground Xenon detector 
at the Sanford Underground Research Facility 
in Lead, South Dakota, may this year boost or 
rule out some of the claims. The king of particle 
hunters — the Large Hadron Collider at CERN 
near Geneva — will shut down until 2015 for an 
upgrade to enable more powerful collisions, but 
physicists will continue to pore over the data 
collected so far for hints of supersymmetry. 


THE LOWER DEPTHS 
Data will start flowing from the first completed 
segments of a giant underwater surveillance 
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network, the US Ocean Observatories Initiative, 
which will cost US$386 million to build and will 
be completed by March 2015. It will monitor 
everything from undersea earthquakes and the 
effects of climate change on ocean circulation, to 
shifting ecosystems and ocean chemistry — all 
the way from the air to the seabed at seven sites 
around the globe. Meanwhile, British, Ameri- 
can and Russian teams will be hoping to find 
out what kind oflife, if any, exists in three deep, 
subglacial Antarctic lakes. 


MAGICAL MATERIALS 

Samarium hexaboride might be the next 
star of materials science, following hints 
last year that it is a topological insulator 
— conducting electricity on its surface, 
but behaving as an insulator inside. Gra- 
phene will remain a major celebrity, so 
expect a flood of reports about copycat 
materials such as boron nitride, tantalum 
disulphide and other two-dimensional 
sheets that can be stacked or sandwiched 
in precise layers. 


GENES IN COURT 

The US Supreme Court could decide a 
number of cases with science implica- 
tions in 2013. It will re-examine whether 
genes are patentable as part of a three- 
year lawsuit considering the validity of 
patents held by Myriad Genetics in Salt Lake 
City, Utah. It may also rule on a challenge to 
seed firm Monsanto, based in St Louis, Mis- 
souri, from a farmer who wants to plant seeds 
gathered from previously grown genetically 
modified crops, rather than buying new stock 
from the company. And the court will consider 
whether brand-name pharmaceutical com- 
panies can pay generic makers to delay their 
launch of generic drugs. 


PAPER MONEY 

A UK policy that requires publicly funded 
researchers to make their results freely avail- 
able will take effect from April. Other countries 
could soon follow — a Global Research Council 
meeting is set to discuss the matter in May. But 
many scientists will be worrying more about 
budgets, with the United States considering 
drastic spending cuts that could take effect 
early this year (see page 13), and Europe set 
to continue debating the proposed €80 billion 
(US$104 billion) in funding for its 2014-20 
research programme, Horizon 2020. m 
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Methane leaks erode green 
credentials of natural gas 


Losses of up to 9% show need for broader data on US gas industry’s environmental impact. 


BY JEFF TOLLEFSON 


ingly high methane emissions from an 

oil and gas field, underscoring questions 
about the environmental benefits of the boom 
in natural-gas production that is transforming 
the US energy system. 

The researchers, who hold joint appoint- 
ments with the National Oceanic and 
Atmospheric Administration (NOAA) and 
the University of Colorado in Boulder, first 
sparked concern in February 2012 with a 
study’ suggesting that up to 4% of the methane 
produced at a field near Denver was escaping 
into the atmosphere. If methane — a potent 
greenhouse gas — is leaking from fields across 
the country at similar rates, it could be offset- 
ting much of the climate benefit of the ongoing 
shift from coal- to gas-fired plants for electric- 
ity generation. 

Industry officials and some scientists con- 
tested the claim, but at an American Geophysi- 
cal Union (AGU) meeting in San Francisco, 
California, last month, the research team 
reported new Colorado data that support the 
earlier work, as well as preliminary results from 
a field study in the Uinta Basin of Utah sug- 
gesting even higher rates of methane leakage 
— an eye-popping 9% of the total production. 
That figure is nearly double the cumulative loss 
rates estimated from industry data — which 
are already higher in Utah than in Colorado. 

“We were expecting to see high methane lev- 
els, but I don’t think anybody really compre- 
hended the true magnitude of what we would 
see,’ says Colm Sweeney, who led the aerial 
component of the study as head of the aircraft 
programme at NOAAs Earth System Research 
Laboratory in Boulder. 

Whether the high leakage rates claimed in 
Colorado and Utah are typical across the US 
natural-gas industry remains unclear. The 
NOAA data represent a “small snapshot” of 
a much larger picture that the broader sci- 
entific community is now assembling, says 
Steven Hamburg, chief scientist at the Envi- 
ronmental Defense Fund (EDF) in Boston, 
Massachusetts. 

The NOAA researchers collected their 
data in February as part of a broader analy- 
sis of air pollution in the Uinta Basin, using 
ground-based equipment and an aircraft to 


\ cientists are once again reporting alarm- 
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Natural-gas wells such as this one in Colorado are 
increasingly important to the US energy supply. 


make detailed measurements of various pol- 
lutants, including methane concentrations. 
The researchers used atmospheric modelling 
to calculate the level of methane emissions 
required to reach those concentrations, and 
then compared that with industry data on gas 
production to obtain the percentage escap- 
ing into the atmosphere through venting 
and leaks. 

The results build on those of the earlier Col- 
orado study’ in the Denver—Julesburg Basin, 
led by NOAA scientist Gabrielle Pétron (see 
Nature 482, 139-140; 2012). That study relied 
on pollution measurements taken in 2008 
on the ground and from a nearby tower, and 
estimated a leakage rate that was about twice 
as high as official figures suggested. But the 
team’s methodology for calculating leakage — 
based on chemical analysis of the pollutants 
— remains in dispute. Michael Levi, an energy 
analyst at the Council on Foreign Relations in 
New York, published a peer-reviewed com- 
ment’ questioning the findings and presenting 
an alternative interpretation of the data that 
would align overall leakage rates with previ- 
ous estimates. 


© 2013 Macmillan Publishers Limited. All rights reserved 


Pétron and her colleagues have a defence of 
the Colorado study in press*, and at the AGU 
meeting she discussed a new study of the Den- 
ver—Julesburg Basin conducted with scientists 
at Picarro, a gas-analyser manufacturer based 
in Santa Clara, California. That study relies 
on carbon isotopes to differentiate between 
industrial emissions and methane from cows 
and feedlots, and the preliminary results line 
up with their earlier findings. 

A great deal rides on getting the number 
right. A study* published in April by scientists 
at the EDF and Princeton University in New 
Jersey suggests that shifting to natural gas 
from coal-fired generators has immediate cli- 
matic benefits as long as the cumulative leak- 
age rate from natural-gas production is below 
3.2%; the benefits accumulate over time and 
are even larger if the gas plants replace older 
coal plants. By comparison, the authors note 
that the latest estimates from the US Environ- 
mental Protection Agency (EPA) suggest that 
2.4% of total natural-gas production was lost 
to leakage in 2009. 

To see if that number holds up, the NOAA 
scientists are also taking part in a comprehen- 
sive assessment of US natural-gas emissions, 
conducted by the University of Texas at Austin 
and the EDE, with various industry partners. 
The initiative will analyse emissions from 
the production, gathering, processing, long- 
distance transmission and local distribution 
of natural gas, and will gather data on the use 
of natural gas in the transportation sector. In 
addition to scouring through industry data, 
the scientists are collecting field measure- 
ments at facilities across the country. The 
researchers expect to submit the first of these 
studies for publication by February, and say 
that the others will be complete within a year. 

In April, the EPA issued standards intended 
to reduce air pollution from hydraulic-frac- 
turing operations — now standard within the 
oil and gas industry — and advocates say that 
more can be done, at the state and national lev- 
els, to reduce methane emissions. “There are 
clearly opportunities to reduce leakage,” says 
Hamburg. = 
1. Pétron, G. etal. J. Geophys. Res. 117, D04304 (2012). 
2. Levi, M.A. J. Geophys. Res. 117, D21203 (2012). 

3. Pétron, G. et al. J. Geophys. Res. (in the press). 
4. Alvarez, R. A., Pacala, S. W. Winebrake, J. J., 


Chameides, W. L. & Hamburg, S. P. Proc. Nat! Acad. 
Sci. USA 109, 6435-6440 (2012). 
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US fiscal deal leaves 
science vulnerable 


Congress delays mandatory cuts to agencies. 


BY HEIDI LEDFORD 


the new year with a frantic deal meant 

to avert a fiscal crisis. But the bill that 
passed the Senate and the House in pre-dawn 
votes on 1 and 2 January keeps researchers on 
tenterhooks for at least another two months by 
delaying mandatory spending cuts that could 
threaten science funding. 

The last-ditch effort aimed to stave off the 
effects of the ‘fiscal cliff’: a painful series of 
tax hikes and budget cuts, scheduled to take 
effect in 2013, that is meant to reduce the US 
budget deficit but could push the country’s 
weak economy back into recession. The cuts, 
known as the sequester, could shrink fed- 
eral support for research and development 
(R&D) by US$57.5 billion over the next five 
years. 

Rancorous last-minute negotiations yielded 
a tenuous agreement to raise taxes for the 
wealthy, but deferred decisions on the seques- 
ter — an across-the-board reduction of about 
8% in nondefence discretionary spending, 


Learns in Washington DC greeted 


with at least 9% carved from defence — by 
two months. “The bill isn’t ideal,” says Eleanor 
Dehoney, vice-president of public policy at 
Research!America, a science-advocacy group 
based in Alexandria, Virginia. “But it does 
give advocates more time to convince policy- 
makers that cutting the US investment in R&D 
is counterproductive.” 

Federal agencies such as the National 
Institutes of Health (NIH) and the National 
Science Foundation have not announced 
what cuts they would make in the event of a 
sequester, but many institutions have drawn 
up contingency plans in case there is a sud- 
den plunge in government funding. Nancy 
Andrews, dean of the Duke University School 
of Medicine in Durham, North Carolina, says 
that the medical school may need to cut back 
on graduate admissions, freeze salaries and 

reduce faculty recruit- 
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who is wrestling with decisions on faculty 
retention and recruitment that must be made 
by mid-January. “If we need to spend more to 
help current faculty members maintain their 
research programmes through funding gaps, 
it will be harder to provide start-up funding 
for new faculty members,” she says. 

The delay means that law-makers will 
debate the sequester at the same time as they 
tackle the overall federal budget. The US Con- 
gress’s inability to agree on a 2013 budget last 
year led it to adopt 


a continuing reso- “The hillisn’t 
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also have reached a 
cap on the amount 
of money that it can 
borrow. 

This timing may increase the momentum 
behind big spending cuts, cautions Michael 
Lubell, director of public affairs for the Ameri- 
can Physical Society in Washington DC. But it 
may also foster a more thorough discussion of 
which programmes should be cut and by how 
much, he adds. Science is unlikely to emerge 
unscarred, says Lubell, but it may end up in 
better shape than it would have under the orig- 
inal sequestration plan. “That doesn’t mean it’s 
going to be wonderful,” he adds. “It means it 
will be less bad.” m 


productive.” 


GEORGE MARSHALL 


Maxine Clarke 


(1954-2012) 


Publishing Executive Editor of Nature. 


contraction. During her time 
at Nature, she held a variety of 
important posts before assum- 
ing her final role as Publishing 
Executive Editor. 

Throughout her career she 
displayed a passionate com- 
mitment to Nature, focusing 
particularly on its scientific 
standards, its championship 
of intellectual integrity, the 


FE” those of us who were lucky enough to have Maxine as a 
colleague, it is so difficult to believe that she is no longer with 
us. She died last month after a battle with cancer that lasted several 
years. It was a battle fought with great personal courage and resil- 
ience, as well as with immense professionalism. 

Given the history of her illness, Maxine’s death should not have 
come as much of a surprise. Yet, despite knowing about her chal- 
lenges, I and her closest colleagues experienced shock and disbe- 
lief on hearing that she had passed away. That, in itself, reflects her 
steadfast refusal to be defined by her illness. She instead maintained 
her active commitment to Nature and her colleagues — who equally 
refused to conceive that anything could deprive them of her counsel. 

Maxine joined Nature in 1984, under the editorship of John 
Maddox, having been a researcher in the biophysics of muscle 


quality of its text, visual pres- 
entation, production stand- 
ards and workflows, and the 
management of its staff. She 
never ceased to care and advocate for the working lives of those 
whom she managed. But, as I know from unsolicited compli- 
ments from authors over the years, she was highly valued outside 
the office too. 

As a senior colleague put it: “Maxine was Nature through and 
through”? Having been thankful for her many insights over the 
years, my colleagues and I will miss her greatly. m 


Philip Campbell, Editor-in-Chief, Nature. 
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A chest X-ray from a patient with tuberculosis (TB) in Lira, Uganda. Uganda is one of 22 countries accounting for roughly 80% of new TB cases each year. 


IBS REVENGE 


The world is starting to win the war against tuberculosis, 
but drug-resistant forms pose a new threat. 


BY LEIGH 
PHILLIPS 


J. MATTHEWS/PANOS 


f there was any doubt that tuberculosis 

(TB) was fighting back, it was dispelled 

in 2005, at the Church of Scotland Hos- 

pital in the village of Tugela Ferry, South 
Africa. Doctors at the hospital, in a rough, 
remote corner of KwaZulu-Natal province, 
were hardened to people dying from gunshots 
and AIDS. But even they were puzzled and 
frightened when patients with HIV who were 
responding well to antiretroviral drugs began 
dying — rapidly — from TB. 

With ordinary TB, patients start to feel 
better after a few weeks or months on a sel- 
ection of four mainstay antibiotics. But of 
the 542 people with TB at the hospital in 
2005 and early 2006, 221 (41%) had a multi- 
drug-resistant (MDR) form, against which 
these therapies are mostly powerless. Worse, 
53 of them did not even respond to the few 
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antibiotics that form a second line of defence. 
Eventually, doctors had nothing left to try: all 
but one of the 53 died, half of them within 16 
days of diagnosis. It was the first major out- 
break of what became known as extensively 
drug-resistant (XDR) TB — and a wake-up 
call to the world that TB had taken a turn for 
the worse’. 

In the early 1980s, TB cases had dropped 
to such low rates that Western policy-makers 
frequently talked of eradication of the disease. 
Then came the HIV epidemic, which triggered 
a resurgence of TB in the late 1990s. But the 
latest report on TB from the World Health 
Organization (WHO), published in October, 
revealed signs of progress against normal — or 
drug-sensitive — cases of the bacterial disease. 
New infections have fallen and the mortality 
rate has dropped by 41% since 1990. But, the 
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report warned, “drug-resistant TB threatens 
global TB control”. Some 3.7% of new cases 
and 20% of previously treated cases are MDR- 
TB. And whereas in 2000 the highest incidence 
of MDR-TB was 14%, in Estonia; in 2010 that 
figure had jumped to 35%, in Russias Arkhan- 
gelsk province. An estimated 9% of drug- 
resistant cases are XDR-TB, which has now 
been reported in 84 countries. 

It is a tale of two TBs. Once detected, 
drug-sensitive TB is almost always treatable, 
as long as the appropriate drugs are provided 
and taken. Simple practices — such as check- 
ing that patients take their medicine — can 
be transformative. But in some countries, par- 
ticularly in eastern Europe, Asia and Africa, 
the weakening or collapse of health-care 
systems over the past two decades has meant 
that patients do not always finish their drugs, 
or they take the wrong ones, allowing highly 
transmissible, drug-resistant strains to emerge 
and spread. 

Drug-resistant TB is harder, more expensive 
and more time-consuming to treat. New tools 
are needed — but there have been no new anti- 
TB drugs in more than 50 years, and the current 
vaccine is largely ineffective. The most common 
diagnostic technique — analysing sputum 
samples under a microscope — can determine 
that Mycobacterium tuberculosis bacteria are 
present but not whether they are drug resistant. 
Meanwhile, researchers have lacked interest in 
developing drugs and tests, and drug companies 
have lacked market incentives to do so. 

The growth of multi-drug resistance is an 
“escalating public-health emergency’, says 
Grania Brigden, TB adviser for Médecins 
Sans Frontiéres (Doctors Without Borders) in 
Geneva, Switzerland: “With barely 1 in 20 TB 
patients being tested for drug resistance, we're 
just seeing the tip of the iceberg” 

But scientists are careful to temper their 
alarm. In the past decade, researchers and 
policy-makers have fought for and won a rever- 
sal in funding and attention for TB. Several 
new drugs are in development, and progress is 
being made towards an effective vaccine. 

“I do worry when people stand up at 
conferences and talk about MDR-TB and 
say it’s a big disaster and the whole world is 
going to collapse. It’s not that severe yet,” says 
Tim McHugh, head of the Centre for Clinical 
Microbiology at University College London, 
who leads a team that is trialling one of the two 
most advanced candidates for new TB drugs. 
“The big anxiety is that if we don’t act now, it 
will easily run away from us.” 


RISE AND FALL 
TB is one of the world’s leading killers, stealing 
1.4 million lives and causing 8.7 million new 
and relapse infections in 2011. One-third of 
the world’s population carries the bacterium, 
but most will never develop the active form of 
the disease. 

The first modern TB epidemic took off 


in the late 1700s, during the Industrial 
Revolution. Rural workers in Europe and 
North America moved in droves to cities, 
where poverty — and related malnutrition 
and overcrowding — created an ideal environ- 
ment for the disease’s spread. But as hygiene, 
nutrition and medicine improved, what was 
known as the Great White Plague began to ebb. 

“By the 1940 and 50s, things looked quite 
bright,” says McHugh, who seems almost as 
interested in the history of TB as in its micro- 
biology. The Bacillus Calmette-Guérin (BCG) 
vaccine, first used in the 1920s, helped. But 
BCG is now effective mainly against child- 
hood TB, which is not infectious, rather than 
the adult form. What really broke TB’s back 
was the introduction of isoniazid, in 1952, and 
then rifampicin, in the 1970s. “If you look at a 
graph of TB from the 1950s onward, [infection 
rates] just collapsed,” McHugh says. 

Then, in the 1980s and 1990s, HIV hit. “You 
can’t underestimate the importance of HIV,” 
McHugh says. A co-infection of TB and HIV 
produces a powerful biological synergy, accel- 
erating the breakdown of the body’s immune 
defences; latent TB infection is 20-30 times 
more likely to become active in people who 
have HIV. In 1993, the WHO declared TB a 
global emergency. Worldwide, TB is now the 
leading cause of death among people with HIV. 

The resurgence of ordinary TB set the stage 
for drug-resistant forms. Resistance develops 
when people do not stick to their drug regimens 
— which typically last six months for drug- 
sensitive TB and 20 months for MDR-TB — 
allowing naturally occurring resistant mutants 
to grow and evolve. MDR-TB, which grew more 
threatening during the 1990s, is resistant to iso- 
niazid and rifampicin. People with this form 
require second-line drugs — broad-spectrum 
antibiotics called fluoroquinolones or injectable 
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accelerating the growth of drug resistance. 
Ina stroke of bad luck, the virulent and often 
drug-resistant ‘Beijing’ strain of TB, identi- 
fied in 1995 in China, swept through Russia 
and eastern Europe just as the region's public- 
health provision was being dismantled. “There 
was a confluence of the biology of the organ- 
ism and its progress into Russia, where lots of 
people had a health-care system that was col- 
lapsing around their ears,’ says McHugh. (In 
2010, the Beijing strain was found in around 
13% of active TB infections worldwide.) All 
of this helps to explain why the recent WHO 
report shows the highest burden of MDR TB in 
Russia's Arkhangelsk province and in Belarus, 
Estonia, Kazakhstan, Kirghizia and Moldova 
(see “Two faces of TB). 


FIGHTING BACK 

Over the past decade or so, the paths of 
drug-sensitive and drug-resistant TBs have 
diverged. The solution for drug-sensitive TB 
is simply to deliver the drugs and diagnos- 
tics to patients — and the drive to do so has 
grown. One of the United Nations Millennium 
Development Goals set in 2000 was to halt and 
begin to reverse the incidence of TB by 2015; 
in 2001, the international Stop TB Partnership 
was established, bringing together government 
programmes, researchers, charitable foun- 
dations, non-governmental organizations 
(NGOs) and the private sector. 

One major result of these and other efforts 
was a global expansion of ‘directly observed 
treatment, short course’ (DOTS), a strategy 
promoted by the WHO to combat drug-sensi- 
tive TB. Once diagnosed, the disease is treated 
with a supply of first-line drugs, taken under 
the close observation of health-care workers to 
ensure that people finish the course. Thanks in 
large part to such efforts, the WHO says that 


"WITH BARELY TIN 20 PATIENTS WITH TB BEING TESTED FOR DRUG 
RESISTANCE, WE RE JUST SEEING THE TIP OF THE ICEBERG. ° 


agents (amikacin, capreomycin and kanamy- 
cin). These treatments are less effective, more 
toxic and take many more months to work than 
first-line therapies. An infection is classified as 
XDR-TB if it is also resistant to fluoroquin- 
olones and at least one of the injectables. The 
XDR-TB outbreak at Tugela Ferry, when it was 
reported in 2006, rattled the TB research and 
policy community. 

Experts agree that the biggest driver for 
the growth in drug-resistant TB has been the 
deterioration in some countries’ health-care 
infrastructures, including TB programmes, 
since the 1990s — particularly in the former 
Soviet bloc. This decline has meant that 
patients are not diagnosed and treated; and 
in some countries, over-the-counter availabil- 
ity of anti-TB drugs also encourages people 
to take inappropriate second-line therapies, 
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the world is on track to halve TB mortality 
from 1990 levels by 2015. 

Tackling drug-resistant TB, however, will 
require not just the rebuilding of health- 
care infrastructure, but also new weapons, 
such as diagnostics, drugs and vaccines. The 
private sector has had little incentive to invest 
in basic research whose eventual products, if 
any emerge, would largely be sold at low cost 
in poor countries. “They have to look at the 
bottom line,’ says Anthony Fauci, head of the 
US National Institute of Allergy and Infectious 
Diseases in Bethesda, Maryland. 

In the past decade or so, global TB pro- 
grammes have pumped money into research. 
One major development came in 1998, 
when researchers at the Wellcome Trust 
Sanger Institute in Hinxton, UK, published 
the genome sequence of M. tuberculosis, 


3 JANUARY 2013 | VOL 493 | NATURE | 15 


| NEWS | FEATURE 


TWO FACES OF TB 


Efforts to tackle drug-susceptible strains have started to cut the global incidence of TB (left), but collapsing 
health-care systems in the former Soviet bloc have helped drug-resistant strains to emerge and spread. 
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allowing researchers to identify and study 
genes that underlie the bacterium’s virulence 
and ability to evade the immune system’. In 
2012, the US National Institutes of Health in 
Bethesda started a bigger genome-sequencing 
project that aims to uncover the genetic roots 
of drug resistance. “We'll use next-generation 
sequencing technologies to sequence 1,000 TB 
clinical isolates from around the world — South 
Africa, Korea, Russia, Uganda — anywhere 
drug-resistant TB is heavily present,” says Fauci. 


WAR CHEST 

There are now ten TB drugs in clinical trials. 
The aim is to find compounds that are effective 
against resistant strains and that work faster 
and have fewer side effects, so that patients will 
be more likely to finish the course. McHugh 
and his team, for example, are running a 
clinical trial at sites across Africa and Asia 
to test the antibiotic moxifloxacin, which is 
commonly used for pneumonia and skin 
infections. (They expect to release prelim- 
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in the past five years. One, called GeneXpert, 
takes 90 minutes to complete and is based on 
a gene-amplification technique that detects 
DNA sequences specific to M. tuberculosis and 
to rifampicin resistance. The system has been 
endorsed by the WHO and subsidized by a coa- 
lition of organizations, but researchers are still 
seeking simpler, cheaper options. 

Only better vaccines will solve the problem 
for good. “We must invest in vaccine research 
if our ultimate goal is to be able to prevent the 
disease rather than forever chase growing drug 
resistance,’ says Helen McShane, a vaccine 
researcher at the University of Oxford, UK. 

In 2008, the European Commission 
pushed for the creation of the TB Vaccine 
Initiative, which draws funding from Euro- 
pean countries, NGOs and private funders. 
These and other efforts have helped to boost 
the number of vaccine candidates from 0 to 
12 since 2000. 

McShane and her team are on the cusp of 
the first efficacy results for MVA85A, one 


"WE MUSTINVESTIN VACCINE RESEARCH IF OUR ULTIMATE GOAL IS 
TO BE ABLE TO PREVENT THE DISEASE. ° 


inary results in 2013.) The researchers are also 
working to speed up the screening process for 
potential drugs by using mycobacterial species 
that are less pathogenic and more fecund than 
M. tuberculosis, which is slow-growing, finicky 
and poses a biosecurity risk. “Previously, what 
you had was a chemist who says ‘T’ve got this 
molecule that will kill Escherichia coli. tm 
fairly sure it should kill TB. But there’s nowhere 
Ican see if it does,” McHugh says. 

Accurate and fast diagnostic tests for 
drug-resistant strains are also a key part of the 
fight, and a number of tests have come online 


of the most clinically advanced TB vaccines 
in the pipeline at present. The shot, which 
McShane helped to develop as a PhD stu- 
dent 15 years ago, contains a virus designed 
to ramp up the activity of T cells that have 
already been primed by BCG. In 2009, in part- 
nership with the South African Tuberculosis 
Vaccine Initiative, McShane launched a major 
phase II clinical trial on nearly 3,000 BCG- 
vaccinated babies in South Africa; early results 
are expected in the first quarter of 2013. In 
parallel, she and her colleagues are also testing 
the vaccine’s efficacy in HIV-infected adults in 
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South Africa and Senegal. 

But are these efforts enough? “Unfortunately 
not,” concludes Karin Weyer, coordinator of 
laboratories, diagnostics and drug resistance 
at the WHO Stop TB Department in Geneva. 
Annual funding for TB diagnosis and treat- 
ment is expected to reach some US$4.8 billion 
in 2013 — but TB care and control are expected 
to demand up to $8 billion a year by 2015. The 
$600 million contributed to TB research in 
2010 also falls well short of the $2 billion the 
WHO estimates will be needed annually — 
and the economic crisis has slowed financing 
across the board. “I need to be and want to be 
optimistic,” says Weyer. “But we're still working 
with shoestring budgets compared to HIV.” 

Meanwhile, the bacterium is not resting. In 
December last year, clinicians in Mumbai, India, 
reported’ the identification of 12 patients with 
what they termed totally drug-resistant TB, or 
TDR-TB. Similar claims had been made a few 
years earlier in Italy and Iran, but this time the 
WHO took it seriously enough to investigate. In 
March 2012, 40 experts convened by the WHO 
concluded that there was not enough evidence 
to say that TDR-TB was substantially different 
from XDR-TB. 

McHugh agrees. But he does not need fur- 
ther evidence to act. In the face of marching 
drug resistance, it is the responsibility of 
researchers to speak out, he says. “I think we 
can no longer be scientists in our labs doing 
fascinating stuff and think we're doing good 
work. We have to evangelize a little bit too.” m 


Leigh Phillips was an International 
Development Research Council fellow with 
Nature until October 2012. 
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Improve weather forecasts 
for the developing world 


Global prediction partnerships would cost little and reduce the regional carnage 
caused by floods, droughts and tropical cyclones, argues Peter J. Webster. 


urricane Sandy hit the northeast 
H coast of the United States in Octo- 

ber with ample warning. The storm 
caused widespread damage, but only around 
125 people died in the region, thanks to 
planning made possible by accurate long- 
range weather forecasts. 

In the developing world, tropical cyclones, 
floods and droughts arrive with little notice 
and kill thousands of people each year. 
Although only 5% of tropical cyclones occur 
in the north Indian Ocean, they account for 
95% of such casualties worldwide’. In 2007 
and 2008, two Very Severe Cyclonic Storms 
— Sidr and Nargis — caused the deaths of 


more than 10,000 and around 138,000 people 
in Bangladesh and Myanmar, respectively. 
Flooding in the Ganges and Brahmaputra 
river basins has displaced more than 40 mil- 
lion people in each of the past few years”. 
In 1998, 60% of Bangladesh was inundated 
with floodwater for almost three months. 
Pakistan’s Indus Valley was devastated by 
floods in 2010, costing more than 2,000 lives 
and US$40 billion’. Flooding struck the area 
again in the summers of 2011 and 2012. 
Droughts condemn millions to hunger 
across the developing world. A three-week 
break in rainfall just after seasonal plant- 
ing, following what seemed to be a normal 
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monsoon onset, caused a disastrous crop 
failure in India in 2002 (ref. 6). The unfore- 
seen dry period was tied to a south Asian 
weather oscillation that occurs every 
30-60 days’. 

An individual living in south Asia or 
Africa can expect to encounter several 
extreme weather events in his or her lifetime. 
Because the resilience of poor populations is 
low and falls with every crisis, the cumula- 
tive effects are relentlessly impoverishing. 
Smallholders often purchase stocks on credit 
that is repaid at the end of the season, so the 
loss of a crop or livestock in one bad year can 
put the farmer into debt for many years, 
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> condemning generations to poverty. 

Owing to advances in prediction science, 
such catastrophes can be forecast anywhere 
in the world with as long a lead time as 
Hurricane Sandy. The problem is tailoring 
complex global forecasts to a country or 
region and communicating them to local 
populations so that they can take action. 

To lessen the impacts of adverse weather, 
networks must be established between the 
forecasters of global weather and climate in 
the developed world, and research, govern- 
mental and non-governmental organizations 
in the less-developed world. The investment 
needed is not high and will pay for itself: my 
research group at the Georgia Institute of 
Technology estimates that such a network 
could forecast floods across Asia for as little 
as $1 million a year, saving billions of dollars 
and thousands of lives. 


NO WARNING 

In most developing countries, weather 
warnings are issued a few days in advance, 
if at all. Yet, for a flood or cyclone, at least 
a week of forewarning is needed to allow 
the slowest members of a society 
(perhaps a farmer and his cattle) to 
evacuate. For short droughts, several / 
weeks’ notice allow smallholders to 
adjust planting and harvesting sched- 
ules. Long droughts require warnings 
to be given months ahead, so that 
farmers can choose resistant crops and 
store fodder and water. 

Although Pakistan’s devastating 
2010 floods arrived with no warning 
in the north of the country, the pulses 
of intense rain responsible could have 
been forecast 8-10 days ahead if avail- 
able data had been analysed at the time’. 
In September 2012, my research group 
used rainfall forecasts and a regional 
hydrological model to predict flooding 
ten days in advance, but Pakistan's gov- 
ernment didn’t implement the warning. 
Fortunately, the floods were less severe 
than those in 2010. 

Regional forecasts with long lead 
times must take global atmospheric 
circulations into account, because local 
weather is influenced by distant events. 
The path of Hurricane Sandy, for exam- 
ple, was swayed by a mid-latitude low- 
pressure system in the Pacific Ocean 
that moved slowly across Canada, 
thousands of kilometres to the west. 

Global weather-forecast models take 
decades to develop, are expensive to build 
and maintain, and are run by only a few 
national or multinational government 
organizations, including the European Cen- 
tre for Medium-Range Weather Forecasts 
(ECMWE), the UK Met Office and the US 
National Centers for Environmental Predic- 
tion (NCEP). 
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BANGLADESH FLOOD ALERTS 


Bangladesh’s floods of July-August 2007 were predicted 
in advance. Forecasts were communicated to five pilot areas (red), 
where community leaders directed people to harvest crops, lea 
e to safety and store water, food and personal belongings. 


Each run of a model begins with a huge 
array of meteorological and oceanographic 
data from more than 30,000 observations on 
land, 3,500 floating buoys and drifters, and 
numerous satellites. Even so, the global data 
coverage is incomplete (particularly in the 
Southern Hemisphere), the measurements 
have errors and the algorithms are imper- 
fect. Global models are thus run many times 
a day using different initial conditions’. 
The ECMWE for example, runs its model 
51 times twice a day, incorporating new ini- 
tial data to produce 1-15-day forecasts, and 
extends its forecasting horizon to 32 days 
ahead twice a week. 

In theory, developing countries can access 
these data streams. The NCEP forecasts, for 
example, are posted on the Internet daily. But 
peeling off regional data from the global del- 
uge is like filling a cup from a fire hose. Inter- 
net transmission costs are high, and timely 
downloading requires a fast data-transfer rate. 
Less-developed countries have small budgets 
and slow Internet connections. Paradoxically, 
as forecasts become better and their resolu- 
tion grows, it becomes more difficult for 


wWs— 


area 


Estimated savings 

per household were 
US$400-500 (about 
one year’s income). 


developing countries to access them. 
Regional forecasting requires calibration 
with local data, such as geographic contours, 
and so is beyond the remit of the global 
weather centres. Intermediaries — research 
groups, universities or companies — can 
form a bridge between global-forecast pro- 
viders and user communities. Partnerships 
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Pilot area 
Inundation 


are springing up, but more are needed. 

Bangladesh offers a success story that could 
be emulated elsewhere. Global forecasts pro- 
duced in Europe are sent to the United States 
and turned into flood forecasts that, within six 
hours, can be integrated into Bangladeshis dis- 
aster-management protocol by local experts. 

The need for a rapid forecasting and warn- 
ing system in Bangladesh became apparent 
following the 1998 floods. The ECMWE the 
Bangladeshi government and my research 
group therefore developed a 1-10-day 
flood-forecasting system and created the 
Climate Forecast Applications Network 
(CFAN) to distribute it. This system was 
first used experimentally in 2004 and became 
operational in 2007. The basic science was 
developed with support from the US National 
Science Foundation, and implementation by 
the CFAN was funded by the US Agency for 
International Development (USAID) and the 
humanitarian agency CARE. 

The CFAN has produced daily forecasts 
of the Brahmaputra and Ganges flows since 
2004, sending them to the Bangladesh Flood 
Forecast and Warning Centre™. If the prob- 
ability of flooding exceeds 80%, warn- 
ings are issued to government offices 
across Bangladesh. 

Planning and training are essential 
for effective use of the forecasts. Before 
the 2007 flood season, village and com- 
munity leaders in six administrative 
unions of Bangladesh were trained to 
interpret the data and to take action if 
flooding was likely. Local leaders could 
tell farmers to harvest crops, shelter 
animals, store clean water and secure 
food, household and farming effects. 

Bangladesh experienced three major 
floods in 2007 and 2008. Each was fore- 
cast successfully ten days in advance 
and mitigation steps were taken** (see 
‘Bangladesh flood alerts’). On the basis 
of a World Bank report’, one analysis 
concluded that about $40 was saved 
for every dollar invested in the regional 
forecasting and warning system. Sav- 
ings at the village level were measured 
in units of annual income’. 

In 2009, to boost regional capacity- 
building, the CFAN handed over its 
flood-forecast modules to the Bangla- 
desh Flood Forecasting and Warning 
Centre. When the large volume of data 
proved too difficult for the centre to 
handle, the responsibility shifted to an 
international non-government entity, 
the Regional Integrated and Multi-Hazard 
Early Warning System (RIMES). 

Funded partly by contributions from mem- 
ber states, RIMES works with governments 
across south and east Asia to incorporate 
regional forecasts into national disaster- 
mitigation programmes and provides warn- 
ings for a range of natural hazards, including 
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earthquakes, tsunamis and extreme 
weather. RIMES is innovative but its 
funding is limited, making it difficult to 
maintain the cadre of scientists necessary 
to tackle specific problems. Inadequate 
funding also hinders the essential updat- 
ing of forecast modules as satellite and 
global forecast systems change. 


GLOBAL PARTNERSHIPS 

Partnerships that bridge the gap between 
the global forecasters and the user com- 
munity need to be established in other 
regions to address a range of weather 
hazards. The plan and type of group that 
forms the bridge will depend on the type 
of hazard being addressed. But the aim of 
each team is the same: to produce hazard- 
forecast modules based on the global fore- 
casts and to use them to provide warnings 
for the region. The team will also be 
responsible for updating the modules as 
systems and technologies change. 

Such partnerships can be aided by 
sustained funding from intergovern- 
mental organizations, such as the United 
Nations, the World Bank and USAID. 
My research group estimates that the 
cost of extended 10-15-day forecasts for 
south and east Asia for a wide range of 
hydrometeorological hazards (includ- 
ing slow-rise monsoon floods, droughts 
and tropical cyclones) is relatively small: 
perhaps $2 million to $3 million per year. 

Asia and Africa stand on the thresh- 
old of great economic advancement and 
can build resilience through the effective 
use of longer-range weather forecasts”. 
Faced with possible climate change, soci- 
eties that learn to cope with and mitigate 
hazards now will be most adept at dealing 
with more frequent and intense hazards 
in the future. m 


Peter J. Webster is a professor of Earth 
and Atmospheric Sciences at the Georgia 
Institute of Technology, Atlanta, USA. 
e-mail: pjw@eas.gatech.edu 
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The correspondence of Francis Collins (left) and John Sulston illuminates a vital part of science history. 


Science today, 
history tomorrow 


We must preserve the interactions of contemporary 
researchers for future scholars, urges Georgina Ferry. 


r | he year 1998 was crucial for the 
Human Genome Project (HGP), an 
international collaboration launched 

eight years before to sequence the complete 

human genome. Spurred by the launch 
of a privately financed sequencing bid by 

Craig Venter, the HGP’s leaders decided 

to accelerate their own efforts. Some of the 

proposed changes caused friction — the 
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HGP was long planned and carefully 
executed. In October that year, John Sulston, 
then director of the Wellcome Trust Sanger 
Institute near Cambridge, UK, felt so 
beleaguered that he sent a strongly worded 
e-mail to Francis Collins, then director of 
the US National Human Genome Research 
Institute (NHGRI) in Bethesda, Maryland. 
The subject line? ‘Friendly fire’ > 
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funding also hinders the essential updat- 
ing of forecast modules as satellite and 
global forecast systems change. 
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regions to address a range of weather 
hazards. The plan and type of group that 
forms the bridge will depend on the type 
of hazard being addressed. But the aim of 
each team is the same: to produce hazard- 
forecast modules based on the global fore- 
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for the region. The team will also be 
responsible for updating the modules as 
systems and technologies change. 

Such partnerships can be aided by 
sustained funding from intergovern- 
mental organizations, such as the United 
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My research group estimates that the 
cost of extended 10-15-day forecasts for 
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HGP was long planned and carefully 
executed. In October that year, John Sulston, 
then director of the Wellcome Trust Sanger 
Institute near Cambridge, UK, felt so 
beleaguered that he sent a strongly worded 
e-mail to Francis Collins, then director of 
the US National Human Genome Research 
Institute (NHGRI) in Bethesda, Maryland. 
The subject line? ‘Friendly fire’ > 
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> For anyone interested in the history 
of the HGP, this e-mail is a key document 
(and one that was later acknowledged as an 
‘emotional outburst’ by its author, who now 
leads the Institute for Science, Ethics and 
Innovation at the University of Manchester, 
UK). Was it a catalyst for improved com- 
munication between the main players? 
Who helped to resolve the conflict? To what 
extent were the directors of the five leading 
sequencing centres competing as well as 
collaborating? The content of the 
e-mail traffic between and within the 
sequencing teams offers a potentially rich 
seam of enquiry. 

As the co-author of Sulston’s account of 
the HGP (The Common Thread; Joseph 
Henry Press, 2002), I saw this e-mail and 
many others, but they are not generally avail- 
able. That may change, thanks to an inter- 
national archiving programme now under 
way. The Wellcome Library in London is 
funding an archivist, Jenny Shaw, to sur- 
vey the documentary record relating to the 
HGP and earlier mapping and sequencing 
activities in the United Kingdom between 
1977 and 2004. Ludmila Pollock, executive 
director of the library and archives at Cold 
Spring Harbor Laboratory (CSHL) in New 
York, is conducting a parallel exercise in the 
United States. The first objective is to cata- 
logue these materials. A longer-term aim — 
which will depend heavily on funding and 
the willingness of the scientific community 
to cooperate — is to secure them in reput- 
able repositories and make them available 
to scholars. 

The programme throws into relief how 
fragile a trace modern science is leaving in the 
historical record. As a scientific biographer, I 
have spent hours happily immersed in piles 
of yellowing papers that are carefully stored 
in archive boxes and guarded by watchful 
custodians in academic libraries. Future bio- 
graphers will not be as lucky. Today’s scientists 
underestimate the historical importance of 
anything other than their published papers; 
they communicate almost entirely electroni- 
cally; and funding for archival preservation 
is increasingly uncertain. If we care about 
documenting the astonishing discoveries of 
the twentieth and twenty-first centuries, we 
must act now. 


GOOD PRACTICE 

Why are archiving exercises such as the HGP’s 
necessary? We are fortunate that Collins, who 
is now director of the US National Institutes 
of Health (NIH), kept his papers, and even 
more fortunate that his successor at the 
NHGRI Eric Green, is employing an archi- 
vist to digitize them. The NHGRI is only 
now developing an archiving policy. Prev- 
iously, says Green, ‘records administration 
at the institute meant throwing things away 
that were ‘no longer needed’ But this material 
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represents only a fraction of the documents 
that record the HGP’s history. The genome- 
sequencing story began before the NIH tooka 
leading role, and it involved many institutions 
and individuals — inside and outside the 
United States — for which the NIH had 

no responsibility. 
There remains an urgent need to 
reconstruct the ‘paper trails’ — often largely 
electronic — that 


“It is difficult eventually led to the 
to convince complete, publicly 
people that available sequence. 
their tweets These include a vast 
and IMs are amount of e-mail 
the stuff of correspondence, as 
history.” well as informal lit- 


erature such as The 
Worm Breeder’s Gazette (now exclusively 
online) and the software that Sulston and 
his colleague Richard Durbin wrote to 
manage their genome-mapping data in the 
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Particle-accelerator building at CERN, which launched its archiving programme in 1979. 


mid-1980s. 

Comprehensive record-keeping is easier 
within a single institution. CERN, the 
particle-physics laboratory near Geneva, 
Switzerland, showed a commendable sense 
of its own historical significance when it com- 
missioned a regularly updated biography in 
1979, 25 years after it opened. Divisional 
records officers at the facility now ensure 
a smooth pipeline through to the central 
archives. The archivists encourage senior 
scientists to have their filing systems 
appraised for historical interest, and they 
have a strategy for selecting and preserving 
e-mails. As the birthplace of the World Wide 
Web, CERN is also working to archive its own 
web pages. 

Scientific archives typically consist of 
institutional records such as CERN’s, and 
the personal papers of distinguished (and, 
usually, dead) scientists. When I wrote a 
biography of the Nobel prizewinner Dorothy 


CERN/SPL 


Hodgkin (1910-94), I relied heavily on her 
collected papers, housed in the Bodleian 
Library at the University of Oxford, UK. 
Among them I found, for example, a letter 
dated June 1939, in which Hodgkin’s con- 
temporary Dorothy Wrinch attempted — 
unsuccessfully — to win sisterly solidarity 
for her erroneous theories on protein struc- 
ture (“Our chromosome count of course 
does not tend to weaken the desire of others 
to attack us,” she wrote). 


FLEETING TWEETS 
But it is increasingly difficult for such 
repositories to capture the full picture. 
Modern science involves experts in numer- 
ous disciplines, collaborating within and 
between institutions. Few will have reached 
the age or eminence at which scientists might 
typically think of depositing their papers. The 
lab notebooks kept by technicians and junior 
research staff are also important. Most of the 
contributors to the HGP are still very much 
alive, so archivists will need permission to 
scan the contents of their e-mail accounts on 
personal hard drives or institutional servers. 
The ubiquity of ‘born-digital’ material, 
such as e-mails and web pages — and now 
tweets, Facebook comments, forum posts and 
instant messages (IMs), which many scientists 
use to share expertise, announce new publi- 
cations or engage in policy debates — makes 
the quest to preserve even more urgent. The 
hardware and software used to generate and 
store digital material can become obsolete 
or be deleted or otherwise destroyed with 
ease, and it is not always clear who owns it. 
The British Library in London is tackling 
these issues with its Digital Lives project; it 
also leads the UK Web Archive collabora- 
tion, which stores pages of scholarly interest. 
Several online tools for archiving tweets by 
hashtag or username are available. 
Technology is the easy bit, however. Much 
more difficult is convincing people that their 
off-the-cuff tweets and IMs are the stuff of 
history. E-mail users do not always file their 
personal and professional messages sepa- 
rately, and many are understandably wary 
of making any of their correspondence 
public. Since the ‘Climategate’ affair in 2009 
— when e-mail servers at the University 
of East Anglia, UK, were hacked and their 
contents publicized — scientists are all 
too aware that an unguarded remark to a 
colleague can snowball into a cause célebre. 
But archivists have long experience in 
handling questions about confidentiality and 
access. The University of Cambridge library 
holds six boxes of love letters that belonged to 
the UK crystallographer J. D. Bernal, which 
are not to be opened until 2021 — 50 years 
after his death. I would trust any of the archi- 
vists I have worked with to respect my wishes. 
“Let’s not wait until memories have 
faded and papers been discarded ... before 


deciding to save our heritage,’ exhorted 
Nobel prizewinner Sydney Brenner in 2007, 
in a letter announcing the donation of his 
papers to the CSHL archive (S. Brenner and 
R. J. Roberts Nature 446, 725; 2007). Others 
may ask why historians have any right see 
material not in the published record. Sulston 
himself, who is supportive of the archiving 
project, admits that “scientists can be quite 
conflicted about historical research, because 
it’s important to forget things and move on.” 

The history of science is much more than 
a chronology of scientific facts and theories. 
Access to informal sources is essential for 
understanding the personal, political and 
social context in which research takes place. 
Take the story of 


the discovery of the “We want 
double-helical struc. our scholarly 
ture of DNA, toldby SUCCessors 
historian Robert Olby 0 be able to 
in The Path to the followthe 
Double Helix (Dover twists and 
Publications,1974), by turns of the 
science writer Horace scientific, 
Freeland Judson in political and 


The Eighth Day of personal 
Creation (Simon and pathways.” 
Schuster, 1979) and 

revisited more recently in the biographies of 
Rosalind Franklin and Francis Crick. 

All these authors had access to 
correspondence and notebooks, without 
which we would have been none the wiser 
about the complex web of personal commun- 
ication (and miscommunication) that led 
Crick and James Watson to their discovery. 
And when more turned up in 2010, the plot 
thickened again. Two decades of Crick’s 
correspondence turned out to be mixed up 
with Brenner’s papers (the two shared an 
office from 1956 to 1977), including a 1953 
letter in which Crick admitted to a col- 
league that if he had seen Rosalind Franklin’s 
photo of the ‘& form of DNA, he would have 
been “considerably worried” (A. Gann and 
J. Witkowski Nature 467, 519-524; 2010). 


SAVE THE SAVERS 
Fifty years hence, do we want the story of 
the sequencing of the human genome — an 
enterprise that forms the bedrock of much of 
twenty-first-century biomedicine — to rest 
on scientific papers and institutional reports, 
leavened only with breathless news reporting 
from the popular media? Surely not. We want 
our scholarly successors to be able to follow 
the twists and turns of the scientific, political 
and personal pathways that intersected as the 
human genome’ 3 billion base pairs winked 
across the screens of the sequencing centres. 
So, what to do? First, institutions and 
individuals need to have the confidence 
to place their records in the hands of pro- 
fessional archivists working in reputable 
repositories. Second, funding bodies need 
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to be convinced that efforts to acquire, store, 
catalogue and disseminate such records are 
essential to preserving our scientific heritage. 

In the United Kingdom, the funding 
situation is grim. The UK National 
Cataloguing Unit for the Archives of 
Contemporary Scientists had a 36-year 
track record of seeking out and cataloguing 
the papers of British scientists before losing 
its core funding and closing in 2009. A 
successor organization, the Centre for 
Scientific Archives (CSA), was established at 
a Science Museum store in Wroughton, UK, 
the same year. The CSA is run bya voluntary 
board chaired by Anne Barrett, an archivist 
at Imperial College London, and it receives 
some in-kind help from bodies such as the 
Royal Society and the National Archives. 

The CSA has no core funding. At present, 
it engages freelance archivists to catalogue 
half a dozen collections of personal papers 
(including those of physicists Joseph Rotblat 
and Gareth Roberts) that came with their 
own funding. Meanwhile, the post of curator 
of the history of science at the British Library 
(which holds the papers of luminaries such 
as Charles Babbage and Alexander Fleming) 
has been frozen since it was vacated in 
February 2011. 

Admittedly, history is not a priority at a 
time of recession. But it is baffling that scien- 
tific heritage has attracted so little support in 
the United Kingdom, especially as the country 
managed to raise almost £8 million (US$12.9 
million) to keep an 1868 painting (Portrait 
of Mademoiselle Claus) by the French artist 
Edouard Manet from being moved overseas 
in August 2012. The project to catalogue the 
HGP is benefiting from the Wellcome Trust’s 
long-standing commitment to the history of 
medical research, and in the United States, the 
CSHL has found early support from private 
donors and foundations. 

But much more funding is needed. And 
it is urgent that we carry out similar scoping 
studies for other areas of science, to identify 
what should be saved before it disappears. 

The sequencing of the human genome did 
not answer every question in biology, but it 
provided a publicly accessible resource that 
can be used for all time by inventive scientists 
with new questions. A documentary archive 
of the project will provide just such a resource 
for historians. If we want future generations to 
understand how science and society interact, 
libraries, research institutions and individual 
researchers must work together to preserve 
the documentary heritage of contemporary 
science. m 


Georgina Ferry is a science writer and 
author and a member of the advisory 
committee of the Wellcome Library’ 
Human Genome Archive Project, 
Oxford OX2 6JE, UK. 

e-mail: mgf@georginaferry.com 
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The opera Laika the Spacedog will tell the story of the first ial to be sent into orbit. 


| LISTINGS | 
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Hot tickets for 2013 
in science and art 


This is your year if you want to rub shoulders with 
canine cosmonaut Laika or astronomer Galileo 
Galilei; travel through time, oscillate, get lost in a fog 
sculpture or ponder extinction; or listen to sound 
projected through liquid nitrogen. Jascha Hoffman 


offers his top tips on science’s cultural calendar. 


Laika the Spacedog 
Science Museum, London 
22-25 January (then on tour) 


Canine cosmonaut Laika was the first animal 
launched into orbit, in 1957. This “highly 
interactive” opera about the Soviet pooch 

is intended to galvanize science-hungry 
schoolchildren. Physical forces, the Solar 
System, rocket science and health are 
explored through singing, animated film, 
puppetry and live action in this English 
Touring Opera production, with music by 
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Russell Hepplewhite and lyrics by Tim 
Yealland. There is even a chance to play a 
theremin. A happy ending may not be in the 
offing, however: a victim of the space race 
between the Soviets and Americans, Laika 
died in orbit. 


A Life of Galileo 
Swan Theatre, Stratford-upon-Avon, UK 
31 January to 30 March 


Bertolt Brecht’s masterly evocation of 
science at bay, A Life of Galileo, gets the 
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Royal Shakespeare Company treatment 

in this new production. Current writer-in- 
residence Mark Ravenhill has penned a 
fresh translation of the seventeenth-century 
physicist-philosopher’s grim battles with the 
Church over heliocentrism. lan McDiarmid 
takes the title role; Roxana Silbert directs. 
Ravenhill, whose debut was Shopping and 
Fucking (1996), is unabashedly political, so 
the Marxist tones of Brecht’s work may be 
writ large. 


OSCILLATOR 
Science Gallery, Dublin 
7 February to 14 April 


Circadian rhythms, 
resonating 
bridges, erupting 
geysers and 
volatile markets 
— Dublin’s dynamic Science Gallery kicks 
off its 2013 season with an exploration of 
oscillation, including cultural fads that cycle 
in and out of vogue. Curator is media artist 
Douglas Repetto, founder of the worldwide 
network Dorkbot, which brings together 
artists, designers, scientists and innovators 
working in electronic art. Dizzy visitors may 
realize that everything comes in waves — 
even shows such as this one, which makes 
way for exhibitions on risk, illusion and 
synthetic biology later in the year. 


Extinction: Not the End of the World? 
Natural History Museum, London 
8 February to 8 September 


The main hall of London’s iconic Natural 
History Museum is dominated by the 
26-metre-long cast of a fossilized Diplodocus 
skeleton. Hundreds of the taxidermied 
beasts that throng its rooms, imprisoned in 
cabinets, are gone for good outside 
its walls. So the museum is 
the ideal spot to explore 
extinction, the subject 
of this major show. It will 
pull together images and 
specimens of species already 
lost, as well as those teetering on 
the brink, to probe issues around 
how human exploitation and other 
factors have depleted the variety of life. 


The Medici: People, Power and Passion 
The Reiss-Engelhorn Museum, Mannheim, Germany 
17 February to 28 July 


Some of the most sumptuous art of the 
Italian Renaissance was bankrolled by the 
Medici — and the dynasty also supported 
science for centuries. Members of the 
illustrious clan backed the likes of Galileo 
Galilei and contributed to natural history, 
medicine and applied mathematics. When 
the last of the ducal line, Anna Maria Luisa de 
Medici, died in 1743, she bequeathed to the 
city of Florence the family treasures — and, 
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unwittingly, a marvel of a 
different sort to posterity. This 
exhibition on the cultural history of the 
Medici will showcase the scientific analysis 
of Anna Maria Luisa’s corpse, exhumed for 
the purpose in late 2012. Those involved in 
the project — which will feature computed 
tomography animations and skeletal 
remains — include the Medical Faculty of the 
University of Florence and Reiss-Engelhorn’s 
own German-Mummy Project. 


Time and Navigation: The Untold Story of 
Getting from Here to There 

Smithsonian National Air and Space Museum, 
Washington DC 

Opens 29 March 


Timekeeping has been essential to 
navigation since the eighteenth century, 
when British carpenter John Harrison 
invented a clock accurate enough for 
sailors to determine their longitude using 


deviations in the positions of celestial bodies. 


And itis still a necessity in the age of the 
Global Positioning System; atomic clocks in 
orbiting satellites must be synced to within 
nanoseconds to allow mobile phones to 
triangulate their position. This exhibition, 

a collaboration between the Smithsonian 
Institution’s National Air and Space Museum 
and National Museum of American History, 
takes in the history of navigation with an 
emphasis on timekeeping. It features the 
clock Charles Lindbergh used to navigate 
the Spirit of St Louis across the Atlantic 

in 1927; a spacecraft whose navigation 
system propelled it to Mercury in the 1970s 


A copy of Mariner 
10, which visited 
Mercury in 1974. 


using the gravity field of Venus; and a self- 
navigating Volkswagen named Stanley. 


Margaret Guthman Musical Instrument 
Competition 

Georgia Institute of Technology, Atlanta 

11-12 April 

This competition at Atlanta’s public-research 
hotspot offers US$10,000 in prizes for 
innovations in “musicality, design and 
engineering”. Winning entries in years past 
have included a device that turns everyday 
objects such as a whisk into musical 
instruments and a vintage slot machine by 
Berlin-based artist Christian Graupner and 
partners that allows players to remix musical 
video clips into an “audio-visual triptych”. 
This year’s judges (among them Berkeley 
computer musicologist David Wessel and 
performance artist Laurie Anderson) will 
face hard choices, if last year’s inventions are 
anything to go by. Those included the textile- 
based Audio Skin and the Resistor JelTone, 

a part-edible toy piano by NYC Resistor, the 
New York-based hacker collective. 


Exploratorium 
New building at Pier 15, San Francisco, California 
Opens 17 April 


When it was opened in 1969 by physicist Frank Oppenheimer, who worked on the 
Manhattan Project with his better-known brother, Robert, the Exploratorium was one of the 
first museums to favour an interactive science education. Now, as it relocates to the city’s 
watertront, the Exploratorium is expanding its artist-in-residency programme through 

its new Center for Art & Inquiry. The museum will unveil site-specific works, including an 
Aeolian, or wind, harp by Doug Hollis that plays the breezes from the bay between Piers 15 
and 17, and a ‘fog sculpture’ by Japanese artist Fujiko Nakaya. 
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COMMENT 
Human Genome exhibition 


Smithsonian National Museum of Natural History, 
Washington DC 
June 2013 to June 2014 


In a year that sees both the 60th anniversary 
of Francis Crick and James Watson’s 
elucidation of DNA’s structure and the 

10th anniversary of the human genome’s 
complete decoding, the Smithsonian 
Institution in is pulling out all the stops. For 
this exhibition, its natural history museum 
joins forces with the National Human 
Genome Research Institute in Bethesda, 
Maryland, to explore what the genome is, 
what it tells us and how this information 
could revolutionize health care and our 
understanding of our place in the world. After 
its time on the National Mall, the show will 
travel around North America. 


Ars Electronica Festival 
Linz, Austria 
Early September 


Founded in Linz in 1979, Ars Electronica 
has long been the festival at the cutting 
edge of electronic art. Over the years it has 
expanded from galleries and conference 
halls to tunnels, factories and even a 
nearby monastery in its effort to bring 
technology-driven art to the public. This 
year, the festival teams up with the budding 
artist-in-residence programme at particle- 
physics powerhouse CERN, near Geneva in 
Switzerland, to premiere a new installation 
by US sound artist Bill Fontana. Mimicking 
the protocol of a scientific experiment, 
Fontana will project short bursts of sound 
through various substances — air, water, 
liquid nitrogen and metal, for example — to 
compare their speed of conduction and 
sonic properties. 


Large Hadron Collider 

Science Museum, London 

8 November 2013 to 30 April 2014 

After last year’s thrilling announcement 

of the discovery of a Higgs boson, it was 
inevitable that the Large Hadron Collider, 
built and operated by some 10,000 scientists 
and engineers, would earn its own exhibition. 
When the travelling show debuts at London’s 
Science Museum, visitors will see the small 
bottle of hydrogen gas that fed protons into 
the 27-kilometre accelerator at CERN, the 
world’s largest particle-physics laboratory, 
near Geneva in Switzerland. Also on display 
will be a giant superconducting magnet used 
to bend the particle beams, which approach 
the speed of light, and historical devices such 
as the apparatus that led to the discovery of 
the electron in 1897. 


Jascha Hoffman is a journalist based 
in San Francisco, California. Additional 
reporting by Daniel Cressey and Alison 
Abbott. 
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INNOVATION 


The Cylindrical Colony: one of several designs for a space settlement mooted by Gerard O’Neill. 


Limits be damned 


Cyrus Mody applauds an examination of the twentieth- 
century scientists who dreamed of breaking the bounds. 


o the best of our knowledge, human 
| life is constrained by natural limits: 
we do not live forever, we cannot 
transport ourselves or transmit information 
faster than the speed of light, and there is 
a finite supply of fossil fuels. Debates about 
such limits have shaped, and been shaped 
by, scientific and technological knowledge 
for centuries. Even faulty predictions about 
limits have made important contributions. 
Thomas Malthus’ pessimism, for instance, 
prepared the ground for Darwin’ theory of 
natural selection, and the overly optimistic 
vision of Lewis Strauss, former chairman 
of the US Atomic Energy Commission, 
of “energy too cheap to meter” facilitated 
decades of nuclear-power research and 
development. 

In The Visioneers, science historian 
Patrick McCray of the University of Cali- 
fornia, Santa Barbara, argues that the 
resource-scarcity debates of the 1970s 
inspired a generation of visionary scientists 
and engineers. This influential crew had big 
dreams about overcoming all kinds of limits; 
occasionally built working models to dem- 
onstrate progress towards their dreams; and 
passionately assembled coalitions to make 
those dreams a reality. 
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McCray focuses 
on Gerard K. O’Neill, 
the Princeton physi- 
cist and designer of 
space colonies, and 
on his protégé, K. Eric 
Drexler, the ‘specula- 
tive engineer trained 
at the Massachusetts 
Institute of Technol- 
ogy (MIT) in Cam- 
bridge who helped to 
put nanotechnology 
on political agendas in 
the early 1990s. Along 
the way, McCray 
introduces a large 
and colourful cast of 
others who, over four 
decades, promoted 
technological progress 
as the way to overcome every limit. 

O’Neill’s ideas reached a mass audience 
in part through the L5 Society founded in 
1975 by Keith and Carolyn Henson. These 
livestock farmers and Tolkien enthusiasts 
from Arizona later drifted into advocacy 
for the Strategic Defense Initiative and cry- 
onic life extension, a proposed technology 


/ 


The Visioneers: 
How a Group of 
Elite Scientists 
Pursued Space 
Colonies, 
Nanotechnologies, 
and a Limitless 
Future 

W. PATRICK MCCRAY 
Princeton University 
Press: 2012. 328 pp. 
£19.95, $29.95 
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by which all or part of a human body would 
be frozen at death in the hope that it could 
be re-animated later. Drexler also imag- 
ined that cryonic immortality could be 
facilitated by programmable ‘molecular 
assemblers — nanometre-scale robots, or 
nanobots — repairing the tissues of corpses 
frozen at death. 

Pillars of the California counterculture — 
such as the psychologist and LSD advocate 
Timothy Leary, and Stewart Brand, founder 
of the Whole Earth Catalog — also took up 
the visions of O’Neill and Drexler, working 
them into manifestos on transhumanism 
and the ‘electronic frontier. Brand served 
on the board of the Foresight Institute, set 
up in 1986 by Drexler, and made Drexler’s 
molecular assemblers a centrepiece of the 
future scenarios that his Global Business 
Network sold to enthralled chief executives. 

McCray documents how cryonics and rad- 
ical life extension, space colonies, molecular 
nanotechnology and exotic sources of energy 
(such as solar-power satellites and zero-point 
energy) were widely popularized, alongside 
unsceptical articles about paranormal phe- 
nomena, by the pornographers Bob Guc- 
cione and Kathryn Keeton in their glossy 
monthly magazine Omni. Indeed, McCray 
argues that the audience that Omni catered 
to — young and male, with a taste for luxury 
goods, high-tech gadgets, libertarian politics 
and libertine excesses — strongly resembled 
the visioneers and many of their followers. 

One thread ran through all of this: the 1972 
blockbuster The Limits to Growth (Universe), 
by global think-tank the Club of Rome. 
This book goaded O'Neill and Drexler, says 
McCray, into sketching their plans for a lim- 
itless future. The Limits to Growth — along 
with public intellectuals such as the biolo- 
gist Paul Ehrlich and the ecologist Garrett 
Hardin, plus fictional films such as Soylent 
Green, Logan’s Run and Silent Running — 
popularized the idea that resource scarcity 
and a growing population would combine 
to create shortages of economically crucial 
materials. That message took root around the 
world in the 1970s, particularly (if temporar- 
ily) in a United States beset by ‘stagflation; oil 
shortages and environmental crises such as 
the Santa Barbara oil spill of 1969. 

However, the original computer models 
on which The Limits to Growth was based, 
developed by veterans of Jay Forrester’s 
systems-dynamics group at MIT, failed to 
account adequately for the role of techno- 
logical innovation in ameliorating resource 
scarcity, at least over the near term. Although 
the models were later refined, the 1972 ver- 

sion provoked a storm 
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on how the hippies people, particularly 
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were infused with the optimism of the US 
space programme, responded to talk of 
scarcity with a visceral aversion. These 
teens and twenty-somethings latched on 
to O’Neill’s visions of suburbs in space 
piping abundant solar power and lunar 
regolith back to Earth. O’Neill himself 
was ambivalent about their support, and 
when his star faded they moved on to 
form or follow other high-tech enthusi- 
ast movements, each of which took The 
Limits to Growth as its foil. 

McCray’s book is especially convincing 
in following the various movements that 
arose in reaction to the Club of Rome’s 
1972 book. At present, we face genuinely 
alarming limits to growth. Our ability to 
comprehend and act on such constraints 
— particularly with respect to climate 
change and alternative energy — is still 
distorted by the infelicities in the first edi- 
tion of The Limits to Growth and the fero- 
cious reaction to its conclusions. Some 
visioneering ideas for overcoming limits 
to economic growth have contributed to 
inaction on climate change by promis- 
ing an appealing but impossibly easy, 
sacrifice-free, small-government path to 
a limitless future. These have distracted 
attention from politically difficult, less 
technology-intensive solutions. 

McCray’s argument that visioneers 
play an important part in the “techno- 
logical ecosystem” is also compelling, but 
asymmetrically deployed. For one thing, 
as the book’s subtitle implies, only those 
who propose a limitless future get to be 
visioneers; technical experts who popular- 
ize visions ofa future that is constrained by 
scarcity (Forrester or the biologist Barry 
Commoner, for example) apparently do 
not count. McCray also sometimes treats 
his visioneers less critically than their 
foils. He describes The Limits to Growth 
as “refuted” by experts, but treats equally 
damning arguments against the visions 
of O’Neill and Drexler in a ‘he-said—she- 
said’ fashion. For instance, Nobel Laureate 
Richard Smalley’s contention that Drex- 
ler’s molecular gears and conveyor belts 
obey an impossible chemistry is dismissed 
as “Drexler and Smalley largely talk[ing] 
past one another”. 

Yet McCray is correct that visioneers 
influence, and are influenced by, an 
ecosystem of philanthropists, politi- 
cians, funding agencies, entrepreneurs, 
undergraduates, scientists and others. 
That group spurs technological innova- 
tion, crafts science policy, and shapes and 
shares widely held visions of the future. m 


Cyrus C. M. Mody is an assistant 
professor in the History Department, Rice 
University, Houston, Texas. 

e-mail: cyrus.mody@rice.edu 
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The cosmological you 


Birger Schmitz weighs up an exploration of how the 


Universe permeates us. 


eil Shubin’s masterpiece Your Inner 
Ne changed the way I see myself. 

Using evidence of the first fish- 
amphibians that left the oceans for land 
375 million years ago, Shubin described 
with stunning clarity how every aspect of our 
anatomy goes back to our distant ancestors. 

Now, in his follow-up The Universe 
Within, he takes the discussion a step fur- 
ther: how the Universe formed, our place in 
the Solar System and the intertwined evolu- 
tion of our planet and life. He shows that all 
this is built into us as physical beings. 

Inside us, for instance, are atoms that 
formed in exploding stars. The movements of 
heavenly bodies are inherent in our percep- 
tion of time and in biological clocks. Physical 
parameters such as gravity determined our 
shapes and sizes. Had Jupiter formed closer 
to the Sun, we would have turned out short 
and squat; farther out and we would have 
been slender and elongated. This is because 
Jupiter formed before the inner rocky plan- 
ets, and its position relative to the Sun deter- 
mined Earth’s size and gravity. 

Shubin starts with the formation of the 
Universe 13.7 billion years ago, segueing into 
that of the Solar System 4.6 billion years ago. 
Much later, about 200 million years ago, when 
the supercontinent ,, 
Pangaea broke up, the The mystery 
continents and ocean of why we 
basins we know today “re here is 
began to form. This Perhaps 
was accompanied by greater than 
the rapid evolution ever.” 
of more complex life 
forms — dinosaurs, mammals and birds. 

Shubin suggests a rather original connec- 
tion between continental break-up and the 
evolution of such creatures: mud settling on 
the vast stretches of coastline created by the 
break-up of Pangaea buried biological mate- 
rial that would otherwise have decayed in 
water, using up oxygen. The result, Shubin 
says, was an increase in atmospheric oxygen, 
one of the key factors that allowed animals 
to conquer land. Mammals require a lot of 
oxygen to maintain their high-energy, warm- 
blooded lifestyle. Life on the low-oxygen Earth 
of 200 million years ago would have been like 
that today at 4,500 metres above sea level. 

Much of the second half of The Universe 
Within summarizes the history of how our 
geological view of Earth developed. It incor- 
porates stories such as how the discovery of 


© 2013 Macmillan Publishers Limited. All rights reserved 


The Universe similar fossil organ- 
Within: isms on distant con- 
Discovering the tinents led Alfred 
poh : teem Wegener and others 
and People towards the idea of 
NEIL SHUBIN continental drift. We 
Pantheon: 2013. also meet William 
240 pp. $25.95, £20 Smith, who invented 
stratigraphy, Louis 
Agassiz, who discovered ice ages, and geolo- 
gist Bruce Heezen and oceanographic car- 
tographer Marie Tharp, who were central to 
developing the theory of plate tectonics. 

Shubin is at his best when he deals with 
anatomy and biology, as in his discussion of 
the inventive geologist Michel Siffre. In 1962, 
Siffre spent two months living in a subter- 
ranean cave to gauge whether he could track 
time without any tools with which to meas- 
ure it. After two months, he was convinced 
that only 37 days had passed. This was in line 
with what we know about the role in ‘internal 
clocks’ of the pineal gland, which regulates 
the production of sleep-inducing melatonin 
depending on the available light. Shubin’s 
storytelling in such passages is gripping. 

The Universe Within is a charming and 
enjoyable read, but it does not reach the 
heights of Your Inner Fish. There is a familiar 
feel to some of the sections, and the book’s 
title raises expectations that are not really 
met. Where are the mysteries of the brain, 
the laws of thought and our consciousness? 
These, to me, are the most amazing aspects of 
the ‘universe within. In my view, the popular 
astronomy writer Timothy Ferris has touched 
on these aspects of the relationship between 
the soul and the Universe in a more thought- 
provoking way in books such as The Mind's 
Sky and The Whole Shebang. 

And what if our view of the Universe 
continues to change as much as it did in the 
past century? From Shubin, one gets the 
impression that much is now solved. But 
the mystery of why we are here is perhaps 
greater than ever. Maybe, as the physicist 
Max Planck put it: “Science cannot solve 
the ultimate mystery of nature. And that is 
because, in the last analysis, we ourselves are 
part of nature and therefore part of the mys- 
tery that we are trying to solve. = 


Birger Schmitz is a professor of geology at 
Lund University in Sweden and the leader of 
the ASTROGEOBIOSPHERE project. 
e-mail: birger.schmitz@nuclear.lu.se 
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Correspondence 


NIH funding: agency 
rebuts critique 


We disagree with Joshua 
Nicholson and John Ioannidis’ 
claim that the peer-review 
system of the US National 
Institutes of Health (NIH) works 
against genuinely innovative 
research, because we believe that 
their analysis is flawed (Nature 
492, 34-36; 2012). 

They use 1,000 or more 
citations as a proxy for identifying 
breakthrough discoveries. 
However, more than 60% of the 
158 highly cited articles they 
analyse cannot be categorized 
as innovative primary research: 
34% are reviews, reports, clinical 
guidelines or descriptions of 
resources; 18% are clinical trials 
of the type primarily funded by 
industry; and 11% fall outside 
the NIH mandate that research 
should have the potential to 
improve human health (for our 
analysis, see http://dpcpsi.nih. 
gov/opa/natcorr). 

Excluding these articles leaves 
58 of the original 158: of these, 
83% were funded by the NIH 
and 17% were funded by private 
industry. 

The NIH welcomes further 
innovation in biomedical 
research, as evidenced by 
funding mechanisms such as 
the NIH Director’s Pioneer, 
Transformative Research, 

Early Independence and New 
Innovator awards. 

George Santangelo Office of 
Portfolio Analysis, National 
Institutes of Health, Bethesda, 
Maryland, USA. 
george.santangelo@nih.gov 
David J. Lipman National Center 
for Biotechnology Information, 
National Institutes of Health, 
Bethesda, Maryland, USA. 


NIH funding: it does 
support innovators 


Joshua Nicholson and John 
Toannidis conclude that “too 
many US authors of the most 
innovative and influential 
papers in the life sciences do 


not receive NIH funding’, 

on the basis of their analysis 

of 200 papers sampled from 
700 life-sciences papers with 
1,000 or more citations (Nature 
492, 34-36; 2012). However, my 
reanalysis of their data suggests 
that the US National Institutes 
of Health (NIH) has supported 
a substantial proportion of such 
contributions over the past 

12 years. 

Fora random sample of 125 of 
the authors’ pool of 700 highly 
cited papers, I found that 63 were 
reviews and 17 fell outside the 
life sciences. Of the remaining 
45 original research papers, 

34 (that is, 75%) were supported 
by the NIH (source: NIH grants 
database and my own analysis); 
the other 11 papers did not 
receive NIH support for various 
reasons (for details, see go.nature. 
com/nywiid). 

Nicholson and Ioannidis 
further underestimate the NIH’s 
support for groundbreaking 
research by requiring both 
the first and last authors to be 
principal investigators on an NIH 
grant when, in fact, first authors 
are often graduate students. 
Steven L. Salzberg Johns 
Hopkins School of Medicine, 
Baltimore, Maryland, USA. 
salzberg@jhu.edu 


NIH funding: the 
critics respond 


In re-analysing our data (Nature 
492, 34-36; 2012), George 
Santangelo with David Lipman, 
and Steven Salzberg each 
exclude two-thirds of the top- 
cited US publications assigned 
by Scopus to life or health 
sciences. We consider that their 
analyses discard most health 
research that matters. 

Clinical trials are primary 
research that catalyses 
preventive or therapeutic 
innovation; reports and 
guidelines also decisively inform 
and radically transform health. 
Extremely highly cited reviews 
formulate pioneering concepts 
or synthesize influential work. 
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Also, scholars reaching the top 
0.01% with papers of any type 
predict some further excellence. 
The authors’ analyses depend 
on grant acknowledgments 
(75-83% in both series) but 
these are a problematic metric 
because they represent grants 
that were awarded 10-15 years 
ago when the NIH budget was 
expanding and acceptance 
rates were highest. Also, any 
of several co-authors can 
acknowledge a funding source 
if they are under pressure to 
demonstrate grant-related 
productivity, even if that source 
is irrelevant. And let’s imagine 
a hypothetical funding system 
that forces all geniuses to quit 
science: 100% of papers could 
still acknowledge funding. 
Groundbreaking projects 
account for less than 1% of 
awarded grants. Students 
whose papers reach 1,000 or 
more citations should certainly 
become principal investigators: 
stars will abandon systems that 
stifle independence. 
John P. A. Ioannidis Stanford 
Prevention Research Center, 
Stanford, California, USA. 
jioannid@stanford.edu 
Joshua M. Nicholson Virginia 
Tech, Blacksburg, Virginia, USA. 


Science alone cannot 
shape sustainability 


We agree that science can and 
should inform the sustainable 
development goals agreed at the 
Rio+20 conference (G. Glaser 
Nature 491, 35; 2012). But basing 
policy decisions on science alone 
may be unproductive. 
Experience indicates that 
political receptiveness to 
scientific advice is essential for 
shaping policy. Climate-change 
policy-makers, for example, 
continually receive new technical 
and scientific input, but they still 
cannot agree how best to act on 
it to mitigate or adapt to climate 
change (J. Depledge Glob. 
Environ. Polit. 6, 1-22; 2006). 
Scientific evidence is rarely 
politically neutral or universally 
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accepted; neither can it replace 
what is inherently a political 
process. 

Suraje Dessai, Stavros Afionis, 
James Van Alstine University of 
Leeds, UK. 

s.dessai@leeds.ac.uk 


Prize marks German 
journal centenary 


One hundred years ago 
today, the publisher Springer 
launched a German 
multidisciplinary journal 
named Naturwissenschaften 
(‘natural sciences’) at the behest 
of physicist and future editor 
Arnold Berliner. The journal was 
closely modelled on Nature. 
Nature maintained its support 
for the journal throughout the 
political upheavals in twentieth- 
century Europe. As a Jew, Berliner 
was forced to resign in 1935. 
Nature wrote: “We much regret 
to learn that on August 13 Dr. 
Arnold Berliner was removed 
from the editorship of Die 
Naturwissenschaften, obviously 
in consequence of non-Aryan 
policy” (Nature 136, 506; 1935). It 
also published a moving obituary 
when Berliner took his own life in 
1942 (Nature 150, 284; 1942). 
Naturwissenschaften — 
The Science of Nature is now 
published exclusively in English. 
During this anniversary year, the 
journal will provide open access 
online to its most highly rated 
100 articles, and will present 
the first Arnold Berliner Award 
for the best research article 
published in 2012, judged using 
Berliner’s original motivations 
of excellence, originality and 
interdisciplinarity. 
Joan Robinson Springer, 
Heidelberg, Germany. 
joan.robinson@springer.com 
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Correspondence may be 
sent to correspondence@ 
nature.com after consulting 
the guidelines at http:// 
go.nature.com/cmchno. 
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FORUM Palaeontology 


Fossils come in to land 


Fossils found in rocks of the Ediacaran period in Australia have been previously characterized as early marine organisms. 
But a report suggests that these rocks are fossilized soils. So did some of these Ediacaran organisms in fact live on land, like 
lichens? A palaeontologist and a geologist weigh up the evidence. SEE LETTER P.89 


THE PAPER IN BRIEF 

@ The Ediacaran period, 635 million 

to 542 million years ago, immediately 
predates the Cambrian period, which saw an 
evolutionary explosion that led to all modern 
animal phyla. 

@ Fossils from an Ediacaran geological 
formation in South Australia have been 
classified as invertebrates, protists or 

fungi, but they have invariably been 


Muddying 
the waters 


SHUHAI XIAO 


Fess in the Ediacara Member in South Aus- 
tralia have been traditionally interpreted as 
representatives of ancestral marine organisms’. 
But, breaking away from this tradition and pur- 
suing his own radical interpretation*”, Retal- 
lack’ now proposes that these fossiliferous beds 
are palaeosols and that some Ediacaran fossils 
are soil lichens or colonies of soil microbes. 
These propositions would represent a funda- 
mental change in our picture of evolution, but 
they will probably face continuing scepticism 
because the evidence is unconvincing. 

Definitive identification of palaeosols in 
the Ediacara Member is a challenge, because 
this unit was deposited before land plants 
arose and it thus lacks features that are diag- 
nostic of ancient soils, such as traces of plant 
roots. But Retallack cites a host of obser- 
vations as evidence that the fossiliferous 
Ediacara Member originated from soil for- 
mation (pedogenesis): its reddish colour, its 
elemental and stable-isotopic geochemistry, 
patterns of surface disruption, and the pres- 
ence of sand crystals of gypsum and nodules 
of carbonate. 

However, the evidence is ambiguous. For 
example, the reddish colour and depletion 
of certain elements in the Ediacara Member 


*This article and the paper under discussion’ were 
published online on 12 December 2012. 


thought of as being marine. 

@ Retallack proposes, in a paper published 
in this issue (page 89)', that the Ediacara 
Member contains fossilized soils 
(palaeosols)*. 

@ The presence of palaeosols suggests that 
some of the fossils within them may have 
been lichen-like organisms or microbial 
colonies that lived on land, rather than 

in the ocean. 


could be a result of weathering that occurred 
during the Cenozoic era (from 65 million 
years ago to the present), rather than result- 
ing from chemical weathering of the rocks 
through pedogenesis during the Ediacaran 
period’. Retallack counters that Cenozoic 
weathering would have produced continu- 
ously reddish strata, 
rather than the 


eps F 
observed alterna- ted ambiguous 
tion of grey and red i ence fe al 
beds, but he fails pe oa 

to recognize that is outweighed 
weathering colours by con elling 
can vary with litho- evidence f or the 
logical character- Marine origin 
istics of the rocks Of the Ediacara 
(such as mineral- Member.” 


ogical composition 

and permeability). In addition, carbonate 
nodules and sand crystals of gypsum are 
common features of marine sediments, and 
the isotope signatures of carbonate nodules 
in the Ediacara Member can be accounted for 
by post-depositional alterations that do not 
involve pedogenic processes. 

Retallack further illustrates his argument 
for palaeosols with examples of large-scale 
disruption structures characteristic of soils 
(see Fig. 2b of the paper’), but such struc- 
tures are intriguingly similar to slumps or 
load structures resulting from subaqueous 
and post-depositional movement of sedi- 
ments. He also depicts small-scale disrup- 
tion structures, which he interprets as having 
been caused by millimetre-sized tubules that 
might be the fossilized remains of bacterial 
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filaments, lichen rhizines (root-like fila- 
ments) or fungal hyphae (see Fig. 2c—g of the 
paper’). But I find this interpretation dubi- 
ous, because the tubules are too irregular to 
be confidently interpreted as being derived 
from microbes. 

In my opinion, this ambiguous evidence for 
pedogenesis is outweighed by compelling evi- 
dence for the marine (or at least subaqueous) 
origin of the Ediacara Member. For example, 
benthic Ediacaran organisms (those that lived 
on or within sediments), such as Cyclomedusa 
davidi and Dickinsonia costata, are preserved 
in situ on rippled bedding surfaces’ (Fig. 1). In 
addition, some Ediacaran fossils show hold- 
fasts (root-like structures) that were dragged in 
the same direction as the alignment of attached 
stalks®. These features could not have formed 
without the action of waves or currents. And 
detailed sedimentological analysis has revealed 
a suite of features characteristic of subaqueous 
deposition’, including ripple marks and 
current lineations. 

A palaeosol interpretation leads Retallack to 
reinterpret fossils in the Ediacara Member as 
the remains of soil lichens, microbial colonies, 
fungi, slime-mould trails or casts of needle ice 
(which forms in frozen soil). However, many 
Ediacaran species in Australia are also found 
worldwide in unambiguously marine forma- 
tions, such as black shales and limestones. 
Furthermore, the Ediacaran fossil Dickinsonia 
shows evidence of intermittent locomotion — 
but lichen do not move. The Ediacaran fossil 
Radulichnus, interpreted as casts of needle ice by 
Retallack’ but as traces of grazing organisms by 
others, has fanning sets of parallel scratches, an 
arrangement that cannot be explained by needle 
ice. And although Retallack proposes that Edi- 
acaran trace fossils are trails left by land-cruising 
slugs or aggregating slime moulds, these organ- 
isms could not have made the burrows that are 
clearly visible in the Ediacara Member. 

On a positive note, Retallack’s persistent 
pursuit of the idea of soils and lichens does 
motivate us to rethink the possibility that 
lichens, whether terrestrial or marine’, might 
have existed at this early time, and played a 
part in regulating the Ediacaran Earth before 
the rise of vascular plants. But we need 
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Figure 1 | Sand or soil. The picture shows Ediacaran fossils (Dickinsonia costata) ona rippled surface, 


found in the Ediacara Member in South Australia. Previous interpretations suggest that these fossils 
represent marine organisms that lived on the sea floor’, but Retallack’ proposes that they were land dwellers. 


clearer evidence before we should consider 
redrawing the timeline of life’s transition from 
sea to land. 


Shuhai Xiao is in the Department of 
Geosciences, Virginia Tech, Blacksburg, 
Virginia 24061, USA. 

e-mail: xiao@vt.edu 


Not all at sea 


he last 700 million years of the Precambrian 

eon was apparently a time of major sea-level 
changes in which continental margins were 
alternately inundated and then left exposed to 
land-surface erosion. Ancient soils were being 
eroded into the sea®, and some should have been 
preserved as palaeosols similar to those in strata 
from more recent times. But so far, reports of 
late Precambrian examples are lacking. This 
may simply reflect the difficulty of recognizing 
soil strata without the root traces so obvious in 
much younger rocks. Other diagnostic features 
are more subtle and are generally recognized 
only by those experienced in studying palaeo- 
sols. Retallack is one such specialist, and he is 
well positioned to argue for the existence of soil- 
inhabiting Ediacaran organisms. 

Retallack has long pondered the nature of 
Ediacaran organisms, and controversially pro- 
posed that they were actually not animals but 
large lichens’. This heterodox interpretation 
would be greatly strengthened if he really has 
found examples that lived on land. Whether 


animal or lichen, the discovery would indicate 
that some organisms mastered the transition 
from marine to non-marine life much earlier 
than currently thought — or even support the 
possibility that the transition went the other 
way, to ultimately account for the Cambrian 
explosion in the sea’. 

So how strong are his arguments? Inter- 
preting ancient depositional environments 
is a tricky business, and a stratigraphic layer 
without telltale root fossils may be a palaeosol 
only in the eye of the beholder. For example, 
what Retallack suggests are geological rel- 
ics of soil deformation have been suggested 
by others to be water-escape features'”. His 
cogent arguments that the red colour repre- 
sents Precambrian weathering will be resisted 
by those familiar with the extensive red colour 
imparted to Australian rocks during mod- 
ern weathering. 
In the absence of 


“His considered : ith 
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characteristics.” —_ precludes wash- 


ups on a shore, as 
he suggests. Furthermore, the isotope data for 
the carbonate nodules that Retallack claims 
represent subaerial exposure are also com- 
patible with coastal recharging of rainwater 
into subsurface aquifers known commonly to 
extend far offshore’. And finally, it is difficult 
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to distinguish the sedimentary structures that 
the author interprets as sand deposited in a 
terrestrial valley from what could belong to 
a submarine canyon, as has been proposed’. 

As is usual in sedimentology, observations 
can be construed in alternative ways, and 
interpretations for these strata have histori- 
cally covered the gamut of geological possi- 
bilities — from lacustrine to lagoonal, coastal 
and open marine. It is appropriate that inter- 
pretations change or are superseded with the 
arrival of new observations, and that is why 
this publication is fascinating and timely and 
should be considered seriously. Although 
Retallack’s ideas are at odds with the accepted 
dogma, these do not need to be mutually 
exclusive. Why should it not be possible that 
some Ediacaran organisms lived on land, even 
if most of the other sites in which they have 
been found are interpreted as marine? There is 
still uncertainty regarding exactly what kind of 
organisms they were, so eliminating a possible 
habitat on the basis of whether or not they are 
animals is unwarranted. Retallack considers 
new data and observations and provides com- 
prehensive reasoning for each of his points. His 
considered case means that researchers scepti- 
cal of his interpretations will need to become 
experts in palaeosol characteristics to mount 
convincing counter-arguments. 

The search for late Precambrian biologi- 
cal evolution in the non-marine realm is an 
exciting new frontier, especially considering 
carbon-isotope data that probably indicate a 
late Precambrian greening of land surfaces”. 
Ediacaran organisms living in soils would be 
further evidence that land areas in this interval 
of Earth’s early history were not biologically 
barren surfaces as is commonly assumed. We 
were not there when all this happened and 
will never know for certain what actually 
happened when. So! say, until the forensic evi- 
dence for Ediacaran habitats becomes strongly 
compelling one way or the other, let multiple 
hypotheses thrive! m= 


L. Paul Knauth is in the School of Earth and 
Space Exploration, Arizona State University, 
Tempe, 85287-1404 Arizona, USA. 

e-mail: knauth@asu.edu 
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COMPUTATIONAL MATERIALS SCIENCE 


Soft heaps and 
clumpy crystals 


A detailed simulation of the packing behaviour of deformable particles settles the 
debate about whether soft matter can adopt an unconventional crystal structure 
at high densities — it can. The hunt is now on for a real-world example. 


FRANCESCO SCIORTINO & 
EMANUELA ZACCARELLI 


passengers are squeezed into close contact 
with one another. But there is a physical 
limit beyond which they cannot go, because 
their bodies cannot occupy the same space. 
This common experience has an equivalent 
at the molecular scale, in what physicists call 
excluded volume — strong repulsive forces, 
of quantum-mechanical origin, that prevent 
atoms from occupying the same space. Because 
of this phenomenon, dense arrangements of 
atoms and molecules result in solids that 
have lattice structures, in which each parti- 
cle excludes neighbours from its site in the 
lattice. It is therefore surprising to read Lenz 
and colleagues’ paper’ in Physical Review 
Letters, which reports that the packing of 
soft particles may result in unusual crystals 
in which each lattice site is occupied not bya 
single particle, but by clumps of particles. 
Soft particles are nanometre- or micro- 
metre-sized macromolecules that have a 
deformable shape. Focusing on polymers, 
for example, one can envisage several differ- 
ent soft particles of increasing complexity 
(Fig. 1). These could be: linear chains; rings, 
in which the ends of a polymer chain are 
connected; stars, in which several polymer 
chains are joined at a common centre; and 
dendrimers, in which several stars are linked 
together. For a fairly small energy cost, the 
structures of soft particles can rearrange and 


[rs the Rome metro at rush hour: 


ts 


interpenetrate to cope with excluded-volume 
constraints at the molecular scale. This allows 
the centres of mass of different soft particles 
to coincide, without any overlapping of the 
monomers (Fig. 1). 

To predict the collective behaviour of soft 
particles, scientists typically use one of two 
theoretical approaches. The first approach 
is to perform calculation-intensive, mon- 
omer-resolved simulations that provide an 
accurate description of a system's properties 
but are at the limit of today’s computational 
capabilities. The second is to use the tools of 
statistical mechanics to develop a simplified 
(coarse-grained) model of a soft particle and 
its interactions with other particles. Such sim- 
plifications are often so crude that each particle 
is represented as a single site at the particle's 
centre of mass, but they allow predictions to 
be derived more easily. 

Theoretical investigations using coarse- 
grained models suggest that, in the dense 
fluid state’, soft particles prefer to sit on top of 
each other, forming heaps or clumps. Indeed, 
beyond a certain density it becomes energeti- 
cally preferable for a particle to completely 
overlap with a few others, rather than be sub- 
jected to the cumulative repulsion of many 
more neighbours (Fig. 2). Similarly, passen- 
gers on the metro could consider piling up on 
top of one another rather than suffering from 
excessive squeezing. In fact, parents often do 
this with their children, by holding them in 
their arms! 

Further work* has shown that if such a 
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Figure 1 | Two particles, one centre. a-d, Soft particles formed from polymers can change shape and 
interpenetrate in such a way that their centres of mass coincide, as shown here for pairs of: polymer 
chains (a); polymer rings (b); three-armed polymer ‘stars’ (c); and dendrimers (connected stars; d). The 
centres of mass (red dots) of the green and purple particles coincide, even though not all of the monomers 


overlap. (Graphic courtesy of Lorenzo Rovigatti.) 
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‘lumpy’ fluid crystallizes, the resulting solid 
retains the clumpy structure to form a regular 
arrangement of heaps. The number of particles 
in each heap varies, giving rise to solids that 
are spatially ordered (as is typical of standard 
crystals), but also locally disordered because of 
the random number of particles in the clumps. 
However, predictions based on coarse-grained 
models are built on approximations that typi- 
cally become progressively less accurate as the 
particle system gets denser, and so are open 
to question. The prediction of clumpy crystals 
from coarse-grained models has therefore 
been seen by some as an academic curiosity, 
especially given that monomer-resolved simu- 
lations of polymer rings* did not confirm the 
predicted formation of such crystals. 

Lenz and colleagues’ monomer-resolved 
simulations for dendrimers now provide 
unmistakable evidence that clumpy crystals 
really can form. The authors’ work builds on 
lessons learned from the earlier computa- 
tional study of polymer rings’, which revealed 
that the particles shrank as density increased. 
The shrinkage progressively invalidated pre- 
dictions made using coarse-grained simula- 
tions. To overcome this problem, the authors 
modelled dendrimers that have a dense core, 
which prevents the particles’ size from varying 
significantly as particle packing increases. The 
researchers’ monomer-resolved simulations 
of the dendrimers closely follow the theoreti- 
cal predictions from corresponding coarse- 
grained models — that is, they confirm that 
clumpy crystals can develop. 

The time is now ripe for an experimental 
search for clumpy crystals composed of 
specially synthesized soft particles. DNA 
dendrimers’ — nanometre-scale particles 
designed to self-assemble from single-stranded 
DNA molecules — may be the optimal candi- 
dates for realizing this unconventional state 
of matter. If so, the resulting clumpy crys- 
tals would enter the fast-growing pantheon 
of DNA constructs with potential uses in 
nanotechnology’. 

Clumpy crystals shed light on fundamental 
physical principles, but they may also have 
practical applications. The variation in the 
number of particles at each lattice site favours 
mass transport, in the form of individual par- 
ticles hopping from one site to another (Fig. 2). 
This could be valuable for applications in 
which lattice rigidity and mass transport need 
to be coupled. Another fascinating possibil- 
ity is that the number of particles occupying 
clumps in a crystal could be controlled by 
compressing the crystal. Indeed, a theoretical 
study has suggested’ that a sequence of crystal 
phases, with differing occupancy numbers, 
occurs as the density of the particles increases. 

The peculiar nature of clumpy crystals 
should be shared by disordered solids — those 
that do not have lattice structures — such as 
glasses. Glassy states® offer unique opportu- 
nities to tailor the viscoelastic properties of 
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Figure 2 | The formation of clumpy crystals 

in two dimensions. a, At high density, soft 
particles (shown as transparent spheres; diameter 
corresponds to each particle's typical size) might 
adopt an arrangement in which they partially 
overlap with several neighbours. The total 
repulsion exerted on each particle by its neighbours 
is high. b, Alternatively, the same particles might 
form a regular lattice of ‘clumps’; in this case, each 
clump contains an average of three overlapping 
particles. Particles belonging to distinct clumps do 
not interact, so that the overall repulsion exerted on 
each particle is less than that in a. Particles might 
also be able to hop between lattice sites (arrow). 
Lenz and colleagues’ numerical simulations’ 

reveal that dendrimeric soft particles form clumpy 
crystals. (Graphic courtesy of Lorenzo Rovigatti.) 


materials. For example, glasses can be melted 
by applying a ‘shear’ deformation force paral- 
lel to a sample’s surface (an effect known as 
shear melting). Analogous to what has been 
reported for clumpy crystals’, the viscosity of 
shear-melted clumpy glasses should increase 
with the intensity of the applied deformation, a 
phenomenon known as shear thickening. This 
behaviour is at odds with that of most materi- 
als, in which the application of shear decreases 
a sample’s viscosity. Finally, the link between 
soft particles and quantum mechanics should 
be noted: boson particles have been predicted” 
to form supersolids, the quantum analogue of 
clumpy crystals, although such supersolids 
have not yet been observed. Perhaps an exper- 
imentally realized clumpy crystal could give 
insight into some aspects of such mysterious 
quantum solids. 

The unconventional behaviour of soft 
matter has often surprised scientists. Lenz 
and colleagues’ study provides yet another 
example of how soft particles at the nano- and 
microscale do not simply reproduce phe- 
nomena known to occur in the atomic and 
molecular world. = 
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NOOO 


Andromeda’s extended 
disk of dwarfs 


Deep-imaging observations of the Andromeda galaxy and its surroundings have 
revealed a wide but thin planar structure of satellite galaxies that all orbit their 
host in the same rotational direction. SEE LETTER P.62 


R. BRENT TULLY 


that roughly half the dwarf companion 

galaxies of the Andromeda galaxy are 
rotating coherently about it in a thin plane. 
Their finding provides a fascinating new 
constraint on theories of galaxy formation. 

First, the observational facts. Androm- 
eda, also known as Messier 31, is the nearest 
giant galaxy to our Galaxy. It is so near that 
a census of its companions by deep imaging 
with the Canada—France—Hawaii Telescope 
has been completed over a large area and to 
a faint level of detection. Because the system 
is close, distances to the companions can be 
measured and the velocities of constituent stars 
determined. Such a complete census for the 
Milky Way is impossible because candidates 
could be anywhere in the sky, even behind a 
zone obscured by the Galaxy. And other giant 
galaxies are too far away to be studied to such a 
level of detail. 

Ibata et al. found that 13 of 27 dwarf 
companions (satellites), at distances from 
Messier 31 of between 35 and 400 kiloparsecs 
(114-1,305 light years), lie in a thin plane 
13 kpc thick and share a coherent velocity 
pattern: those to the north of Messier 31 are 
moving away from Earth relative to the galaxy, 
and those to the south are moving relatively 
towards Earth. No theorist of galaxy formation 
would have dared to predict such a situation. 
What's more, the Milky Way is in the same 
plane as the 13 satellites. The discovery of this 
plane is a spectacular result, and the authors 
avoid the risk of diluting their message by not 
mentioning more speculative matters that add 
to the intrigue. 

Although the disk of Messier 31 is tilted by 
about 50° from the plane of the satellites, the 
rotation in the galaxy is in the same direction 


E this issue, Ibata and colleagues’ report 


of motion as the satellite-velocity pattern. 
Looking beyond Ibata and colleagues’ survey 
region, the three galaxies that lie 250-500 kpc 
from Messier 31 — IC 1613, IC 10 and LGS3 
— and which were known before the advent of 
deep-imaging surveys, all reside in the same 
satellite plane. This is particularly interest- 
ing because these three more-distant galax- 
ies contain substantial interstellar gas and 
are still forming stars, and so might be recent 
arrivals on the scene. All the satellites in the 
authors’ survey region are gas deficient (except 
Messier 33, which is not on the plane being 
discussed) and, according to standard galaxy- 
formation models, would be presumed to have 
been in the vicinity of Messier 31 for some time 
and to have complex orbits. 

But things are even stranger than Ibata and 
colleagues suggest. The remaining known 
Messier 31 satellites can be split roughly in 
equal numbers into those at lower and higher 
Galactic longitude. All of those at higher lon- 
gitude than Messier 31, including the Local 
Group’s third-largest galaxy Messier 33, lie 
in a separate common plane. This secondary 
plane is offset and tilted by about 13° from the 
primary plane through Messier 31. 

Ibata et al. remark on earlier suggestions that 
satellites of the Milky Way also seem to lie ina 
plane”. It occurred to me that perhaps there 
was enough information in the data archives 
to evaluate the distribution of companions in 
the next-nearest groups of galaxies — those 
around the dominant galaxies Centaurus A 
and Messier 81. The regions around these gal- 
axies have been closely studied in surveys for 
satellite candidates and in follow-up observa- 
tions with the Hubble Space Telescope*”. In the 
case of Centaurus A, 22 of 24 satellites within 
600 kpc of the galaxy’s centre separate into two 
equally populated planes that are roughly par- 
allel but offset by 280 kpc. Centaurus A lies in 


3 JANUARY 2013 | VOL 493 | NATURE | 31 


© 2013 Macmillan Publishers Limited. All rights reserved 


| RESEARCH | NEWS & VIEWS 


one of these planes. Most of the companions 
to Centaurus A are gas poor, but there are 
several systems that contain gas and in which 
star formation is occurring in each plane. 

The situation in the Messier 81 Group is less 
compelling but still suggestive. Here, there is 
a distinction between the distribution of the 
gas-poor satellites and that of gas-rich satellites 
that are undergoing star formation. The gas- 
poor systems lie in a flattened distribution that 
have characteristic dimensions of 60 x 120 kpc, 
with the flattening coincident with the ‘Local 
Sheet’ structure® that harbours all the galax- 
ies mentioned above and which extends over 
along dimension of 10 Mpc and with a thick- 
ness of 1 Mpc. The gas-rich satellites typically 
lie farther from Messier 81, and loosely align 
to a plane of their own. 

This discussion of the organized distribu- 
tion of satellites is anchored in the solid evi- 
dence reported by Ibata and colleagues for a 
thin plane with coherent kinematics. There 
are hints that structure in the distribution of 
satellites is the norm. The subject deserves 
further attention, but it should be noted that 
the planes that have been discussed on scales 


DEVELOPMENTAL BIOLOGY 


of 300-500 kpc have a general alignment 
with the Local Sheet. This sheet forms a wall 
of an anti-structure, the ‘Local Void’ that 
strongly affects the development of nearby 
structure’, 

Ibata et al. only touch on possible scenarios 
underlying the formation of the planar struc- 
tures. The new information compounds a 
familiar galaxy-formation problem — a defi- 
ciency in the numbers of satellites found com- 
pared with theoretical expectations”*®. Now, 
it seems, not only is there a paucity of satel- 
lites, but also most of those that do exist are in 
these organized structures. The very organiza- 
tion suggests that the structures (possibly as 
distinguished from their constituents) are 
not ancient. 

Current ideas about galaxy formation pro- 
pose that material (both gas and already con- 
structed galaxies) falls into the extended haloes 
around galaxies as flows along filaments. The 
orbital angular momentum of the infalling 
material over time tends to cause motion that 
has the same direction of rotation as that of 
the dominant galaxy in the halo, resulting in 
the build-up of a spiral disk in the galaxy. It 


Segmentation 


within scale 


Irrespective of an organism’s size, the proportional sizes of its parts remain 
constant. An experimental model reveals size- dependent adjustment of segment 
formation and gene- expression oscillations in vertebrates. SEE LETTER P.101 


NAAMA BARKAI & BEN-ZION SHILO 


eveloping organisms face a major 
D challenge: their body pattern must 

be adjusted — scaled — to their body 
size. But how is tissue size ‘measured’? And 
what conveys general size information to the 
local settings of each cell? Despite intense 
interest, the mechanistic basis of scaling is 
poorly understood. On page 101 of this issue, 
Lauschke et al.' report that scaling persists in 
a tissue-culture model that simulates early 
segmentation in the vertebrate embryo. The 
simple, two-dimensional geometry of this 
system, and the fact that it can be visualized 
in real time and manipulated, opens exciting 
avenues for studying the formation and scaling 
of vertebrate segmentation”. 

The segmented organization of vertebrates 
is set up in the early embryo. As the embryo 
elongates along an anterior—posterior axis, 
segmented structures called somites bud 


*This article and the paper under discussion! were 
published online on 19 December 2012. 


regularly from the anterior end of its imma- 
ture presomitic mesoderm (PSM) tissue” *. 
The number of segments differs between spe- 
cies, but varies little between individuals of the 
same species. Seminal work by the develop- 
mental biologist Jonathan Cooke showed that 
surgical manipulations that reduce embryo 
size generate smaller yet well-proportioned 
embryos that are patterned normally along 
both anterior—posterior’ and dorso-ventral® 
axes. In particular, somites become propor- 
tionally smaller, but their number and relative 
position are maintained’. 

The observation that somite number and 
size are regulated independently prompted the 
‘clock and wavefront’ model, which postulates 
that spatial and temporal inputs are combined 
to define somite size and position”*. According 
to this model, the position at which a somite 
can be formed at a given time is defined by 
molecular concentration gradients that are 
positioned at a fixed distance from the pos- 
terior pole (the wavefront), and that move 
posteriorly through the PSM towards the 
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is reasonable to assume that newly accreted 
satellites would share the sense of rotation, 
but that after a few orbits they would tend to 
become scrambled. Because infalling galaxies 
around Messier 31 adhere to sucha thin plane, 
it would seem that they do not take many 
excursions before they are absorbed in the 
central galaxy. m 
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pole as the embryo elongates. In parallel with 
this, cell-autonomous oscillations in gene 
expression that are coordinated across the 
tissue define the timing of segment formation. 
The overall outcome is sequential segment 
formation in an anterior-to-posterior 
direction. 

The predicted oscillations in gene activity 
have been visualized in chemically fixed and 
in live embryos, and correlate with the pro- 
gressive pattern of somite formation. Further- 
more, genetic manipulations of wavefront 
velocity or oscillation frequency modulate 
segment size’, as predicted by the clock and 
wavefront model. 

In the intact embryo, oscillations are syn- 
chronized between adjacent cells, probably 
through the activity of the Notch signalling 
pathway. The frequency of oscillation in gene 
expression decreases towards the anterior 
PSM, so that anterior cells reach maximal 
signalling activity later than posterior cells. 
Therefore, the pattern of Notch activity seems 
to propagate from the posterior to the anterior 
PSM. This wave of molecular activity is not 
part of the original clock and wavefront model. 
So, what could be the function of such waves? 
Do they contribute to somite differentiation 
or scaling? And what is the molecular basis 
for these dynamics? Answering these and 
related questions is greatly facilitated by the 
ability to visualize’ and perhaps perturb the 
differentiation process as it progresses”®. 

Although methods for live imaging of intact 
embryos have been developed, they are lim- 
ited, in part because of the complex geom- 
etry of the embryo. Lauschke et al. present an 
ex vivo (tissue culture) model that recapitulates 
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Figure 1 | Oscillating waves of gene expression in culture. Lauschke et al.' obtained presomitic 
mesoderm (PSM) tissue from the posterior region of a mouse at day 10.5 of embryonic development. 
They then plated this sample, which carried a Notch-signalling reporter molecule, on a culture dish. 
The reporter reveals waves of gene expression that progress in a central-to-peripheral direction. Some 
20 hours after plating the cells, segments begin to form from the periphery towards the centre. Notably, 
segment size is adjusted (scaled) with the decreasing number of available cells, so that each new segment 
encompasses 20% of the remaining non-segmented region. Gene expression in cells at the centre and 
periphery oscillates at the same phase (darker shade of brown). Thus, the phase gradient between 
adjacent cells becomes steeper as the tissue size decreases. 


the segmentation process in a simple two- 
dimensional geometry. They took a tissue slice 
from the posterior PSM of a mouse embryo 
and maintained it in culture. Cells grew out 
from this cultured tissue as a monolayer and 
began to show the hallmarks of segmentation. 
Most notably, periodic waves of a ‘reporter’ 
molecule for Notch activity seemed to propa- 
gate from the centre of the monolayer towards 
the periphery. After the tissue had grown to 
acertain size, segments began to form from 
the periphery, in a temporal sequence that was 
coordinated with the wave-like activity of the 
Notch reporter (Fig. 1). Altogether, the authors 
could detect up to 15 oscillations, at approxi- 
mately 140-minute cycles, and the formation 
of five or more segments. 

Strikingly, segment scaling was maintained 
under these ex vivo conditions. Formation of 
a segment decreased the size of the remain- 
ing unpatterned tissue. Consequently, the 
subsequent segments that formed within this 
smaller region had correspondingly smaller 
sizes — fixed at 20% of the remaining non- 
differentiated cells. Scaling was also observed 
in the velocity of the oscillatory waves of gene 
expression, which decreased in proportion 
to unpatterned-tissue size; this meant that 
the time it took the waves to propagate from 
the centre to the differentiation front in the 
periphery remained constant throughout 
the process. 

The authors’ further analysis revealed that 
the best predictor of segment size is the phase 
gradient, namely, the rate at which the oscilla- 
tion phase changes across the differentiating 
tissue. But what determines the phase gradi- 
ent, and how does it scale with tissue size? 
The phase gradient is a kinetic property that 
is not directly linked to any physical entity. 
As such, scaling based on the phase gradient 
is different from scaling based on molecu- 
lar gradients studied previously'’*. The 


correlation between phase gradient and seg- 
ment size may imply a causal relationship. 
Alternatively, it could result from a mutual 
dependence on some other factor. 

Perhaps in support of the latter possibility, 
Lauschke et al. observed a temporal increase in 
the steepness of the phase gradient even before 
segments had begun to form. One possibility 
is that both the phase gradient and segment 
size are dictated by gradients of Fgf or Wnt — 
signalling molecules that regulate develop- 
ment. The authors report that gradients of 
these morphogens are indeed established 
across the tissue, but further quantification 
is required to determine whether the gradi- 
ents scale with tissue size, and to analyse their 
potential role in defining the phase gradient 
and segment formation. 

Whether the segment scaling that Lauschke 
and colleagues observe in their ex vivo system 
reflects the scaling mechanism that com- 
pensates for size variations in live embryos 
deserves further investigation. Nevertheless, 
the intriguing ex vivo dynamics impinges on 
fundamental aspects of the clock and wave- 
front model. How are the clock genes of indi- 
vidual cells adjusted to generate a culture that 
shows coordinated oscillations in gene expres- 
sion? Does this coordination involve long- or 
short-range interactions between the cells? 
What determines the initiation of segment for- 
mation? What is the role of Fgf and Wnt in this 
process? The authors’ model system holds the 
promise of providing answers to these ques- 
tions, as well as additional quantitative insight 
into the process of segment formation in the 
vertebrate embryo. = 
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50 Years Ago 


‘Education and the humanist 
revolution: By Sir Julian Huxley — 
The knowledge explosion of the past 
hundred years has given us anew 
vision of human identity — of the 
world, of man, and of man’s role in the 
world... It leads inevitably to a new 
dominant organization of thought 
and belief, and, after centuries of 
ideological fragmentation, to a new 
comprehensive idea-system, which 
I call ‘evolutionary humanism 

... Our new system must itself be 
evolutionary, not change-resistant 
but change-promoting; it must 
transform as well as transmit. In 
part, it can be achieved by making 
girls and boys understand the moral 
duty of helping and guiding the 
evolutionary process in a desirable 
direction. But something more 
practical is also needed. If our aim 
be greater fulfilment, the next step 
in psychosocial evolution must be 
from the Welfare State towards a 
‘Fulfilment Society: A humanist 
educational system will put the 
idea of a fulfilment society before 
children, and will provide them with 
opportunities for actual personal 
fulfilment. 

From Nature 5 January 1963 


100 Years Ago 


Perfect Health For Women and 
Children. By Elizabeth S. Chesser — 
Miss Chesser has to be commended 
for having treated a wide subject in 
such a sound, common-sense and 
practical manner as will make the 
book appeal to every class of reader, 
both lay and medical. The author 
does not mince matters when she 
finds fault with the unhygienic 
practices of the present day; and 

the work is full of good, telling 
sentences, such as, “if women paid 
as much attention to their teeth as 
they do to their complexions, they 
would be 50 per cent healthier and 
better looking” 

From Nature 2 January 1913 
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Membrane enzyme 
cuts a fine figure 


Malfunction of presenilin enzymes, which cleave proteins in cell membranes, can 
lead to Alzheimer’s disease. A crystal structure of a microbial presenilin provides 
insights into the workings of this enzyme family. SEE ARTICLE P.56 


MICHAEL S. WOLFE 


he interior of a cell membrane is a water- 

repelling environment. So the discov- 

ery’ that some protease enzymes use 
water molecules to cut other proteins within 
membranes was surprising. Three types of 
such enzymes, which have a variety of roles 
in biology and disease, have been identified: 
zinc-containing site-2 proteases, rhomboid 
serine proteases and aspartyl proteases, such 
as presenilin. Atomic-resolution structures of 
site-2 protease and rhomboid enzymes™ have 
greatly improved our understanding of the 
mechanisms by which these proteases cleave 
their substrate proteins, but such a structure 
for a presenilin has remained elusive until now. 
On page 56 of this issue, Li and colleagues* 
describe the first detailed structure of a prese- 
nilin-type protein, providing a framework for 
future mechanistic studies and drug-discovery 
programmes*. 

There are two types of presenilin: those that 
function as single polypeptides, such as the sig- 
nal peptide peptidase’, and those that require 
other proteins for activity. Presenilins of the 
second type assemble into y-secretases, which 
are enzyme complexes, composed of four differ- 
ent proteins, that cleave many single-pass mem- 
brane proteins (each one containing a single 
transmembrane domain, or TMD), including 
the Notch receptor and the amyloid-f precursor 
protein (APP)°. The cleavage regulates the func- 
tions of the target proteins and releases peptides 
that can have various activities. Functional 
y-secretases are essential for Notch signalling 
processes’®, which regulate cell differentiation 


*This article and the paper under discussion’ were 
published online on 19 December 2012. 


during development and adulthood in multi- 
cellular animals. Moreover, mutations in genes 
encoding presenilins can cause early-onset Alz- 
heimer’s disease by altering how the amyloid-f 
protein is produced from APP cleavage. 
Electron microscopy has provided low- 
resolution structural images of y-secretase 
complexes, but the production of an atomic- 
resolution crystal structure will be highly chal- 
lenging. Li et al. show that this is much more 
feasible (although still not easy) for a presenilin 
alone, representing an important step towards 
determining the structure of the entire enzyme 


complex. Their report provides a case study in 
how to approach the crystallization and struc- 
ture determination of amembrane protein, an 
area of investigation currently at the frontier of 
structural biology. 

The authors set out to identify a presenilin, 
or a presenilin-type protein, that could be 
overproduced in a bacterial host, purified in 
its active form and concentrated sufficiently 
for crystallization trials. After trying several 
proteins derived from a variety of organisms, 
they focused on a protease (mmPSH) from 
the archaean microorganism Methanoculleus 
marisnigri. This effort involved considerable 
protein engineering, leading to the identifica- 
tion of five mutations that improved the solu- 
bility of mmPSH, supported protease function 
and allowed the formation of high-quality 
crystals for structure determination. 

The structure of mmPSH is only a snapshot, 
one stable conformation among a continuum 
of others. Even so, it confirms that presenilins 
contain nine TMDs and that two aspartate 
amino-acid residues — located in TMD6 and 
TMD7, and known to be essential for protease 
activity’ — are close to each other and buried 
in the membrane (Fig. 1). The structure seems 
to be consistent with previous biochemical 
studies of presenilins and y-secretases regard- 
ing the arrangement of the TMDs, the water- 
accessibility of certain residues purportedly 
near the active site (the part of the enzyme 
where the cleavage reaction takes place), and 
the interaction with substrate proteins®. But it 
provides far more detail than previous studies 
and presents some surprises as well. 

One surprise is that the mmPSH protease 
has a pore that goes through the entire trans- 
membrane region; it may be one route for 
water to enter for the cleavage reaction. It 


Substrate 


Products 


Figure 1 | Architecture ofa presenilin enzyme. Presenilins are membrane-embedded protease 
enzymes that cleave other transmembrane proteins in a regulated manner. Li et al." show that a microbial 
presenilin-type protein, mmPSH, has nine transmembrane domains (TMDs; shown as columns). The 
active site, which drives the cleavage reaction, is composed of two aspartate amino-acid residues (denoted 
as D), one in TMD6 and the other in TMD7. The authors’ results suggest that lateral movement of the 
substrate protein into the active site for cleavage is gated by TMD9. The mmPSH protein contains cavities 
(not shown) that might allow entry of water, which is activated by the catalytic aspartates for substrate 
cleavage into products. N and C represent the amino and carboxy termini of the protein. 
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would be interesting to know whether ions or 
small molecules can traverse this pore, because 
presenilin has been reported to act asa calcium 
channel”’. Alternatively, the pore might be 
plugged by a lipid or other small molecules in 
the membrane. 

Another surprise is that the protein asso- 
ciates to form tetramers (composed of four 
mmPSH units), although the functional sig- 
nificance of this finding is unclear. It has been 
suggested that several units of presenilin might 
be present in the y-secretase complex, but such 
an oligomeric arrangement is not required for 
catalysis’. Nevertheless, a tetrameric organiza- 
tion of signal peptide peptidases and preseni- 
lins cannot be ruled out, and the new structure 
will allow the design of specific experiments to 
test its relevance. 

The mmPSH structure also suggests how a 
substrate protein might interact with the pro- 
tease and gain access to the active site. This is 
an especially complicated issue for proteases 
embedded in the membrane: the cleavage site 
is within the substrate’s TMD, and the water- 
containing active site is protected from the 
water-repellent environment of the membrane. 
The substrate, which is restricted to movement 
in two dimensions within the membrane, must 
therefore first interact with the outer surface of 
the protease before gaining access to the inter- 
nal active site. The route of substrate entry in 
mmPSH is apparently between TMD6 and 
TMD%9, which is consistent with previous 
research’ on the interaction of substrates with 
human y-secretase. 

However, the new crystal structure requires 
some conformational adjustment for the 
catalytic aspartates to be properly aligned 
and close enough to interact with each other 
and to activate water for substrate cleavage. 
The authors suggest that interaction with the 
substrate may lead to correct alignment of the 
aspartates. This is a realistic possibility, but 
another is that the protease must be embedded 
in membranes of appropriate lipid composi- 
tion or be solubilized by activity-supporting 
detergents. In any event, the reported structure 
is only one stable conformation of a preseni- 
lin protease, and future crystal structures that 
capture other conformations might provide a 
clearer sense of protease dynamics and sub- 
strate interaction. This is what happened for 
rhomboid enzymes” and site-2 proteases’. 

Liet al. also tried to provide more-specific 
insights into human presenilin, particularly 
into how mutations that cause Alzheimer’s 
might affect protease activity. First, they built 
a structural model of human presenilin, based 
on the mmPSH structure. Then they created 
several mmPSH variants containing specific 
changes in amino-acid residues that mirrored 
disease-causing mutations, and explained the 
resultant effects on enzyme activity on the 
basis of the position of these mutations in their 
human presenilin model. 

Despite reasonable similarity between the 


two proteins, this is the point at which the 
limitations of extrapolating from the micro- 
bial protease to human presenilin in active 
protease complexes are reached. The effects 
of the mutations in mmPSH varied: some 
reduced or abolished activity, others did not. 
The relevance of this result is debatable, given 
that the consistent effect of the mutations in 
human presenilin is to increase the proportion 
of longer amyloid-B peptides with a higher 
tendency to form aggregates (a process asso- 
ciated with Alzheimer’s disease), and not a 
reduction of overall protease activity, although 
this can occur with some mutations’*”*. 
Despite these caveats, the mmPSH struc- 
ture provides clear insight into the nature and 
function of membrane-embedded aspartyl 
proteases. The microbial protein is likely to 
have the same basic fold and core structure as 
human presenilins, so Li and colleagues’ work 
opens up a whole new horizon that should 
ultimately lead to a detailed understanding of 
human presenilins and of the entire y-secretase 
complex. Such understanding should lead to 
specific ideas about how disease-causing 
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mutations alter function, and how small 
molecules might be designed for safe and 
effective treatment of Alzheimer’s disease. = 
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Allin the timing 


How influential are the various factors involved in curbing global warming? 
A study finds that the timing of emissions reduction has the largest impact on 
the probability of limiting temperature increases to 2°C. SEE LETTER P.79 


STEVE HATFIELD-DODDS 


limate science sometimes seems to have 
overtaken economics as the most dis- 
mal science. But a study by Rogelj et al. 
on page 79 of this issue’ might just change that. 
The authors quantify the importance of five 
‘uncertainties’ that are thought to influence 
the chance of limiting global temperatures 
to different levels, using a suite of models to 
generate around 500 scenario variations. They 
find that the timing of international action to 
limit emissions has by far the largest impact. 
Furthermore, the models show that the impact 
of timing is highly nonlinear, and that delay- 
ing emissions limits by only five years, from 
2020 to 2025, would dramatically cut the likeli- 
hood of limiting warming to 2 °C. The findings 
should help to make risks and consequences 
more transparent, and thereby support better- 
informed economic and political decisions. 
The five major uncertainties assessed by 
Rogelj and colleagues were the following: the 
responsiveness of the physical climate system 
to cumulative emissions; the deployment of 
energy- and land-based emission-reduction 
technologies; the global demand for energy 
(which includes combined uncertainties 


about population, income growth and energy 
efficiency); the global carbon price that the 
international community is willing to impose; 
and the timing of substantive action to limit 
emissions (phased in from 2010). The analysis 
covers limiting the temperature in 2100 to 1.5, 
2, 2.5 and 3°C above pre-industrial levels, with 
a focus on 2°C. 

These scenario comparisons revealed 
timing of global action to be the uncertainty 
with the greatest effect. For example, the 
authors find that bringing forward global 
action on emissions from 2020 to 2015 would 
improve the chance of limiting temperatures 
to 2°C from 56% to 60%, all else being equal. 
To put this another way, achieving the same 
60% chance of success with action starting 
in 2020 would require a 2020 carbon price of 
around US$150 per tonne of carbon dioxide 
equivalent (CO,e) — more than double the 
$60 per tonne CO, e required if action begins in 
2015. However, delaying emissions limits from 
2020 to 2025 would bring the chance of suc- 
cess down to 34%, and the authors found no 
scenario in which a feasible increase in carbon 
price or improvements in energy technology 
could make up for these five years of delay. 

Geophysical uncertainties are the next most 
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significant factor, followed by societal factors 
influencing energy demand, and then uncer- 
tainties surrounding mitigation technologies. 
This information brings clarity to the relative 
contributions of the timing of global action; 
the role of energy efficiency; potential mitiga- 
tion technologies; and required carbon prices. 
And it will help to assess trade-offs that might 
be made between these factors. Rogelj and col- 
leagues’ analysis also suggests that higher car- 
bon prices will drive more-rapid deployment 
of low-emission technologies and improve the 
chance of success, but that imposing a carbon 
price above $150 would make very little differ- 
ence to temperature outcomes, because this car- 
bon-price path is sufficient to drive the uptake 
ofall the modelled reduction technologies. 
The study, and another recent analysis 
by some of the same authors’, also find that 
the use of different technologies affects both 
the chance and cost of success. Here, the key 
results are that the development and deploy- 
ment of carbon capture and storage technology 
for fossil fuels and, subsequently, bioenergy are 
essential for limiting temperature increases to 
1.5°C, and that failure to deploy such technol- 
ogy would reduce the chance of limiting tem- 
perature increases to 2°C by 14-16 percentage 
points (with a 2020 carbon price of $60-150). 
Rogelj and colleagues’ assessment comple- 
ments existing published economic analysis of 
the costs and benefits of emissions reductions. 
That literature is characterized by two under- 
appreciated points of consensus. First, the key 
protagonists in the debate over the British gov- 
ernment’s 2006 Stern Review’ on the econom- 
ics of climate change now all agree that action 
to limit temperature rise to 2°C (or emissions 
to 450 parts per million CO,e) would provide 
net benefits*”; this represents a quiet rever- 
sal®”’ in the position of economist William 
Nordhaus, who was initially critical of the 
review. Second, economic analysis shows that 
ambitious global action to limit emissions is 
fully consistent with strong economic growth 
and improvements in living standards*. A 
recent multi-model review’ finds, for exam- 
ple, that average global income is projected to 
roughly double over the four decades to 2050 
across all scenarios targeting 450 parts per mil- 
lion CO,¢, and that the average income growth 
in emerging economies and other developing 
countries would be above the global average. 
Perhaps the only significant limitation 
of Rogelj and colleagues’ analysis is that the 
exploration of geophysical uncertainties 
does not fully incorporate known positive 
feedbacks on climate change, particularly 
the effect of CO, and methane releases from 
warming permafrost'®"'. (This is a limitation 
of climate modelling generally.) Although 
the timing and magnitude of these releases 
are highly uncertain, on current temperature 
trends the cumulative emissions from perma- 
frost are considered likely to be more than 
30 gigatonnes CO,¢ by 2040, with significant 


emissions continuing for more than two centu- 
ries'’’”. This has the potential to dwarf reduc- 
tions in anthropogenic emissions. However, 
this consideration simply strengthens the case 
for early and decisive action to limit anthro- 
pogenic emissions, because it would reduce 
the temperature increases that could trigger 
such feedbacks. This suggests that it would be 
valuable for economic studies to explore the 
‘insurance value’ of reducing the risk of climate 
feedbacks. 

Minor limitations of the study are that it 
uses a single modelling suite for analysis and 
is based on a single, somewhat optimistic 
view of underlying population and economic 
trends'*“*, This implies that the authors’ esti- 
mated probabilities of achieving particular 
temperature limits may also be somewhat 
optimistic, and that it would be good to repli- 
cate this work using other models and a wider 
range of underlying scenarios. 

Rogelj et al. have provided a new bench- 
mark for assessing the relative contributions 
of several major uncertainties in the quest to 
limit climate change. The study’s key message 
reinforces previous findings’**° that urgent 
and more ambitious global action is required 
to maintain any chance of limiting global 
warming to 2°C. The clear finding that the 
world would be better off acting from 2015 
rather than 2020 also raises sharp and serious 
questions about the trade-offs implicit in the 
current pace of global negotiations and action. 
The window for effective action on climate 
change is closing quickly, and Rogelj et al. have 
put a price tag on each year of delay. m 
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Break down the walls 


Nanoscale imaging reveals that bacterial and fungal enzymes use different 
mechanisms to deconstruct plant cell walls. The finding may provide clues 
about how to enhance the efficiency of liquid-biofuel production from biomass. 


RICHARD A. DIXON 


Pp lants are increasingly being used as raw 


materials in the production of ethanol 

and other liquid biofuels. But the poor 
accessibility of sugars embedded in plant cell 
walls — known as recalcitrance — is a major 
barrier to economically viable implementation 
of these technologies’. Although recalcitrance 
is an inherent property of plant cell walls, dif- 
ferent microorganisms use different enzymes 
to degrade the walls, and a lack of understand- 
ing of these interactions has limited the design 
of plants that have reduced recalcitrance. 
Writing in Science, Ding et al.” employ state- 
of-the-art microscopy techniques to show, at 
nanometre resolution, plant cell walls being 
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degraded through distinct mechanisms by 
bacteria and fungi. 

The saying that there is no such thing as a 
free lunch seems particularly apposite when 
considering the evolution of plant cell walls. 
Some plant cells are surrounded by a second- 
ary cell wall that includes a thick layer of cel- 
lulose (a polysaccharide) associated with the 
complex organic polymer lignin. This sec- 
ondary thickening provides physical strength 
to support upright growth, hydrophobicity to 
allow water transport, and protection against 
microbial ingress. In parallel, microbes have 
evolved mechanisms, including cellulose- 
degrading enzymes, to degrade plant cell walls 
to access the plants’ nutritious sugars. Fungi 
use degradative enzymes called cellulases, 
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significant factor, followed by societal factors 
influencing energy demand, and then uncer- 
tainties surrounding mitigation technologies. 
This information brings clarity to the relative 
contributions of the timing of global action; 
the role of energy efficiency; potential mitiga- 
tion technologies; and required carbon prices. 
And it will help to assess trade-offs that might 
be made between these factors. Rogelj and col- 
leagues’ analysis also suggests that higher car- 
bon prices will drive more-rapid deployment 
of low-emission technologies and improve the 
chance of success, but that imposing a carbon 
price above $150 would make very little differ- 
ence to temperature outcomes, because this car- 
bon-price path is sufficient to drive the uptake 
ofall the modelled reduction technologies. 
The study, and another recent analysis 
by some of the same authors’, also find that 
the use of different technologies affects both 
the chance and cost of success. Here, the key 
results are that the development and deploy- 
ment of carbon capture and storage technology 
for fossil fuels and, subsequently, bioenergy are 
essential for limiting temperature increases to 
1.5°C, and that failure to deploy such technol- 
ogy would reduce the chance of limiting tem- 
perature increases to 2°C by 14-16 percentage 
points (with a 2020 carbon price of $60-150). 
Rogelj and colleagues’ assessment comple- 
ments existing published economic analysis of 
the costs and benefits of emissions reductions. 
That literature is characterized by two under- 
appreciated points of consensus. First, the key 
protagonists in the debate over the British gov- 
ernment’s 2006 Stern Review’ on the econom- 
ics of climate change now all agree that action 
to limit temperature rise to 2°C (or emissions 
to 450 parts per million CO,e) would provide 
net benefits*”; this represents a quiet rever- 
sal®”’ in the position of economist William 
Nordhaus, who was initially critical of the 
review. Second, economic analysis shows that 
ambitious global action to limit emissions is 
fully consistent with strong economic growth 
and improvements in living standards*. A 
recent multi-model review’ finds, for exam- 
ple, that average global income is projected to 
roughly double over the four decades to 2050 
across all scenarios targeting 450 parts per mil- 
lion CO,¢, and that the average income growth 
in emerging economies and other developing 
countries would be above the global average. 
Perhaps the only significant limitation 
of Rogelj and colleagues’ analysis is that the 
exploration of geophysical uncertainties 
does not fully incorporate known positive 
feedbacks on climate change, particularly 
the effect of CO, and methane releases from 
warming permafrost'®"'. (This is a limitation 
of climate modelling generally.) Although 
the timing and magnitude of these releases 
are highly uncertain, on current temperature 
trends the cumulative emissions from perma- 
frost are considered likely to be more than 
30 gigatonnes CO,¢ by 2040, with significant 


emissions continuing for more than two centu- 
ries'’’”. This has the potential to dwarf reduc- 
tions in anthropogenic emissions. However, 
this consideration simply strengthens the case 
for early and decisive action to limit anthro- 
pogenic emissions, because it would reduce 
the temperature increases that could trigger 
such feedbacks. This suggests that it would be 
valuable for economic studies to explore the 
‘insurance value’ of reducing the risk of climate 
feedbacks. 

Minor limitations of the study are that it 
uses a single modelling suite for analysis and 
is based on a single, somewhat optimistic 
view of underlying population and economic 
trends'*“*, This implies that the authors’ esti- 
mated probabilities of achieving particular 
temperature limits may also be somewhat 
optimistic, and that it would be good to repli- 
cate this work using other models and a wider 
range of underlying scenarios. 

Rogelj et al. have provided a new bench- 
mark for assessing the relative contributions 
of several major uncertainties in the quest to 
limit climate change. The study’s key message 
reinforces previous findings’**° that urgent 
and more ambitious global action is required 
to maintain any chance of limiting global 
warming to 2°C. The clear finding that the 
world would be better off acting from 2015 
rather than 2020 also raises sharp and serious 
questions about the trade-offs implicit in the 
current pace of global negotiations and action. 
The window for effective action on climate 
change is closing quickly, and Rogelj et al. have 
put a price tag on each year of delay. m 
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Break down the walls 


Nanoscale imaging reveals that bacterial and fungal enzymes use different 
mechanisms to deconstruct plant cell walls. The finding may provide clues 
about how to enhance the efficiency of liquid-biofuel production from biomass. 


RICHARD A. DIXON 


Pp lants are increasingly being used as raw 


materials in the production of ethanol 

and other liquid biofuels. But the poor 
accessibility of sugars embedded in plant cell 
walls — known as recalcitrance — is a major 
barrier to economically viable implementation 
of these technologies’. Although recalcitrance 
is an inherent property of plant cell walls, dif- 
ferent microorganisms use different enzymes 
to degrade the walls, and a lack of understand- 
ing of these interactions has limited the design 
of plants that have reduced recalcitrance. 
Writing in Science, Ding et al.” employ state- 
of-the-art microscopy techniques to show, at 
nanometre resolution, plant cell walls being 
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degraded through distinct mechanisms by 
bacteria and fungi. 

The saying that there is no such thing as a 
free lunch seems particularly apposite when 
considering the evolution of plant cell walls. 
Some plant cells are surrounded by a second- 
ary cell wall that includes a thick layer of cel- 
lulose (a polysaccharide) associated with the 
complex organic polymer lignin. This sec- 
ondary thickening provides physical strength 
to support upright growth, hydrophobicity to 
allow water transport, and protection against 
microbial ingress. In parallel, microbes have 
evolved mechanisms, including cellulose- 
degrading enzymes, to degrade plant cell walls 
to access the plants’ nutritious sugars. Fungi 
use degradative enzymes called cellulases, 
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Figure 1 | Degradation of plant cell walls by microbial enzymes. Cellulolytic enzymes degrade 
cellulose, the polymer that forms a significant portion of the cell walls of plants. Many bacteria produce 
complexes of cellulolytic enzymes called cellulosomes, whereas fungi attack plant cell walls by secreting 
free cellulolytic enzymes called cellulases. Ding et al.” used nanometre-scale microscopy to show that 
plant degradation by these two strategies occurs through different mechanisms. a, They observe that 
bacterial cellulosomes peel off individual cellulose microfibrils from the cell-wall surface and around 
plasmodesmata (small tubes that connect adjacent plant cells). b, By contrast, their microscope images 
show fungal cellulases penetrating deep into the cellulose microfibrillar network. 


whereas in bacteria, multiple enzymes self- 
assemble into a complex called the cellulo- 
some*. However, the lignin in secondarily 
thickened plant cell walls presents an effective 
barrier to most microbial enzyme systems. 
Many of the feedstock materials used for bio- 
fuel production are lignocellulosic, includ- 
ing perennial grasses, trees and maize (corn) 
stover — the leaves and stalks of the plant left 
after grain harvest. 

In the United States, the 2005 Energy Policy 
Act included a Renewable Fuel Standard (RFS), 
which gives a minimum volume of biofuels to 
be used in the national transportation fuel 
supply each year. However, the US National 
Academy of Sciences recently concluded’ that 
in the absence of “major technological innova- 
tion ... the RFS2-mandated consumption of 
16 billion gallons of ethanol-equivalent cel- 
lulosic biofuels by 2022 is unlikely to be met”. 
Overcoming cell-wall recalcitrance may be the 
technological innovation that has the largest 
impact on reducing production costs of liq- 
uid biofuels from lignocellulosic feedstocks. 
It is clear that reducing lignin content, either 
by taking advantage of natural variation in 


plant varieties or through targeted transgenic 
approaches, can enhance the release of cell- 
wall sugars”®, although this can have negative 
effects on plant growth’. But might there be 
other ways to target recalcitrance? Ding and 
colleagues sought to pinpoint which fea- 
tures of the plant cell wall most affect their 
microbial digestibility. 

The authors used a suite of correlated, multi- 
modal microscopy approaches to observe, in 
real time, fluorescently labelled microbial 
enzymes degrading cell-wall polysaccharides 
of maize stover. This revealed intriguing dif- 
ferences between the digestive strategies of 
bacteria and fungi (Fig. 1). The images show 
that bacterial cellulosomes start digesting the 
cell walls away from the middle lamella — 
the cell-wall layer that joins adjacent plant 
cells. By contrast, fungal cellulases dissolve 
the wall in a uniform manner from the inner- 
most side, leaving the middle lamella intact. 
Furthermore, it seems that fungal cellulases 
penetrate the cellulose microfibrillar network 
to produce digestion pits, whereas cellulosomes 
peel off individual microfibrils from the 
cell-wall surface. 
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The authors’ observations also confirm 
that lignin is indeed the major impediment 
to cell-wall degradation by either fungal 
cellulases or bacterial cellulosomes. 
But they take the story an important 
step forward by showing that the main 
problem is that lignin physically masks the 
cellulose microfibrils, rather than absorb- 
ing lignin-degrading enzymes. By quantifying 
images obtained by atomic force microscopy, 
the authors show that the hydrophobic planar 
faces of cellulose molecules are the preferred 
binding sites of microbial enzymes, and are 
therefore crucial for enzyme access. The hydro- 
phobic faces are exposed in primary cell walls, 
but are masked by lignin in secondary walls, 
either partially in certain parenchyma cells or 
totally in schlerenchyma-fibre cells. 

What do these findings mean for the devel- 
opment of biomass feedstocks? Lignification 
of some cell types occurs after the plant has 
finished growing, but early harvesting of per- 
ennial crops presents sustainability problems 
because essential nutrients such as nitrogen 
and phosphorus are transferred back to the 
root system only during this senescent phase 
of the plant life cycle*. Ding and colleagues’ 
study suggests that it might be just as effec- 
tive, if not more so, to engineer plants to have 
reduced levels of lignin—polysaccharide link- 
ages as it would be to attempt to reduce lignin 
levels per se. This would produce a cellulose- 
microfibril structure that is more conducive to 
degradation and would substantially reduce 
the severity of pretreatment required to 
render lignocellulosic biomass suitable for 
biofuel generation. 

It will be interesting to apply the micros- 
copy techniques used by Ding and colleagues 
to probe the interactions between microbial 
enzyme systems and plants that have been 
genetically modified to have reduced recalci- 
trance through targeting of cell-wall compo- 
nents. Only by understanding recalcitrance 
from the perspectives of both the plant and the 
microbe can more effective bioprocessing sys- 
tems be developed. However, whether modify- 
ing plant cell walls for optimal deconstruction 
will allow microorganisms to jump the lunch 
queue and cause disease in their plant targets 
remains to be seen. m 
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Non-Fermi-liquid d-wave metal phase of 
strongly interacting electrons 


Hong-Chen Jiang’, Matthew S. Block’, Ryan V. Mishmash’, James R. Garrison’, D. N. Sheng*, Olexei I. Motrunich® 
& Matthew P. A. Fisher? 


Developing a theoretical framework for conducting electronic fluids qualitatively distinct from those described by 
Landau’s Fermi-liquid theory is of central importance to many outstanding problems in condensed matter physics. 
One such problem is that, above the transition temperature and near optimal doping, high-transition-temperature 
copper-oxide superconductors exhibit ‘strange metal’ behaviour that is inconsistent with being a traditional Landau 
Fermi liquid. Indeed, a microscopic theory of a strange-metal quantum phase could shed new light on the interesting 
low-temperature behaviour in the pseudogap regime and on the d-wave superconductor itself. Here we present a theory 
for a specific example of a strange metal—the ‘d-wave metal’. Using variational wavefunctions, gauge theoretic 
arguments, and ultimately large-scale density matrix renormalization group calculations, we show that this 
remarkable quantum phase is the ground state of a reasonable microscopic Hamiltonian—the usual t-J model with 
electron kinetic energy t and two-spin exchange J supplemented with a frustrated electron ‘ring-exchange’ term, 
which we here examine extensively on the square lattice two-leg ladder. These findings constitute an explicit 


theoretical example of a genuine non-Fermi-liquid metal existing as the ground state of a realistic model. 


Over the past several decades, experiments on strongly correlated 
materials have routinely revealed, in certain parts of the phase dia- 
gram, conducting liquids with physical properties that are qualita- 
tively inconsistent with Landau’s Fermi-liquid theory’. Examples of 
these ‘non-Fermi-liquid’ metals” include the strange-metal phase of 
the copper-oxide superconductors** and the heavy fermion materials 
near a quantum critical point®®. However, such non-Fermi-liquid 
behaviour has been challenging to characterize theoretically, largely 
owing to the lack of a weakly interacting quasiparticle description. It is 
even difficult to define a non-Fermi liquid unambiguously, although 
possible deviations from Fermi-liquid theory include, for example, 
violation of Luttinger’s volume theorem’, vanishing quasiparticle 
weight, and anomalous thermodynamics and transport**"'’. This 
theoretical difficulty is probably preventing a full understanding of 
the mechanism behind high-temperature superconductivity and also 
hampering theoretically guided searches for new exotic materials. 
Pioneering early theoretical work on the copper oxides relied on 
two main premises*'*""’, which guide but do not constrain our pursuit 
of non-Fermi-liquid physics: (1) that the microscopic behaviour can 
be described by the square lattice Hubbard model with on-site 
Coulomb repulsion, which at strong coupling reduces in its simplest 
form to the t-J model; and (2) that the physics of the system can be 
faithfully represented by the ‘slave-boson’ technique, in which the 
physical electron operator is written as the product of a slave boson 
(‘chargon’), which carries the electronic charge, and a spin-half fer- 
mionic ‘spinon’’’, which carries the spin (both chargon and spinon 
are strongly coupled to an emergent gauge field). However, within the 
slave-boson formulation, it has been difficult to access non-Fermi- 
liquid physics at low temperatures because this requires the chargons 
to be in an uncondensed, yet conducting, quantum phase'’—the elu- 
sive ‘Bose metal’. Early attempts to describe the strange metal in this 
framework treated it as a strictly finite-temperature phenomenon in 


which the slave bosons form an uncondensed, but classical, Bose 
fluid’>"*, a treatment which excludes the possibility that the strange 
metal is a true quantum phase at all. 

In our view, the strange metal should be viewed as a genuine two- 
dimensional (2D) quantum phase, which may be unstable to super- 
conducting or pseudogap behaviour. Indeed, recent experimental 
work on Laz_,Sr,CuO, has shown that when superconductivity is 
stripped away by high magnetic fields, strange-metal behaviour per- 
sists over a wide doping range down to extremely low temperatures”. 
Thus, the strange metal in the copper oxides is quite possibly a true, 
extended, zero-temperature quantum phase’. 

Inspired by these results and building on our previous work, which 
proposed” and realized’*™ a true, zero-temperature Bose metal, we 
use a variant of the slave-boson approach to construct and analyse an 
exotic 2D non-Fermi-liquid quantum phase, which we refer to as the 
“d-wave metal’. The d-wave metal is modelled by a variational wave- 
function consisting of a product of a d-wave Bose-metal wave- 
function’ for the chargons and a usual Slater determinant for the 
spinons. Importantly, placing the chargons in the d-wave Bose metal 
state provides the many-electron wavefunction with a sign structure 
that is qualitatively distinct from that of a simple Slater determinant, 
and in particular, imprints strong singlet d-wave two-particle cor- 
relations. This results in a gapless, conducting quantum fluid with 
an electron momentum distribution function that exhibits a critical, 
singular surface that violates Luttinger’s volume theorem’, as well 
as prominent critical Cooper pairs with d-wave character. The d- 
wave nature of our phase is tantalizingly suggestive of incipient 
d-wave superconductivity and is thus of possible relevance to the 
copper oxides. 

Furthermore, tying back into premise (1) above, we propose a rea- 
sonably simple model Hamiltonian to stabilize the d-wave metal by 
augmenting the traditional t-J model with a four-site ring-exchange 
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term K. Then, thanks to the numerical and analytical tractability 
provided by the density matrix renormalization group (DMRG)**”° 
and bosonization’”*°, we can place the problem on a quasi-one- 
dimensional (1D) two-leg ladder geometry (see Fig. 1). In this system, 
we establish several lines of compelling evidence that the d-wave 
metal phase exists as the quantum ground state of our t-J-K model 
Hamiltonian, and we are able to characterize and understand the phase 
very thoroughly. Importantly, our realized two-leg d-wave metal state 
is non-perturbative, in that it cannot be understood within conven- 
tional Luttinger liquid theory” starting from free electrons*'. We 
believe this study to be one of the first unbiased numerical demonstra- 
tions of a non-Fermi-liquid metal as the stable ground state of a local 
Hamiltonian. We also discuss straightforward extensions of these 
results to two dimensions, and comment on their potential relevance 
to the actual non-Fermi liquids observed in experiments. 


Gauge theory and variational wavefunctions 


Our theoretical description of the non-Fermi-liquid d-wave metal 
begins by writing the electron operator for site r and spin state 
s = J, | as the product ofa bosonic chargon b(r) and fermionic spinon 
f(x); that is, c,(r) = b(r)f,(1). With b(r) a hard-core boson operator, 
this construction prohibits doubly occupied sites, an assumption 
we make from here on. The physical electron Hilbert space is reco- 
vered by implementing at each site the constraint b*(r)b(r) = 
DA (f(r) = o,f (r)e(r) =n-(r), which physically means that 
a given site is either empty or contains a chargon and exactly one 
spinon to compose an electron. Theoretically, this is achieved by 
strongly coupling the b and f fields via an emergent gauge field’. 

Under the natural assumption that the spinons are in a Fermi-sea 
state, the behaviour of the chargons determines the resulting elec- 
tronic phase. Condensing the bosonic chargons so that (b(r)) 0 
implies c,(r) x f,(1); so, in this case, the electronic phase is that of 
a Fermi liquid. It then follows that to describe a non-Fermi-liquid 
conducting quantum fluid within this framework, the chargons 
must not condense, (b(r)) = 0, but must still conduct. However, acces- 
sing such a “Bose metal’ phase has proved extremely difficult. In recent 
work’, however, we have realized a concrete, genuine Bose-metal 
phase, which we named the “d-wave Bose liquid” or, equivalently, the 
“d-wave Bose metal” (DBM). The DBM is central to our construction 
of the d-wave metal. Specifically, in the DBM, we decompose the hard- 
core boson as b(r)=d,(r)d>(r) with the constraint dj (r)d,(r) = 
di (r)d,(r) =b'(r)b(r), where d, and dy are fermionic slave particles 
(‘partons’) with anisotropic hopping patterns: d, (d2) is chosen to hop 
preferentially in the x (7) direction. The resulting bosonic phase is a 
conducting, yet uncondensed, quantum fluid, which is precisely the 
phase into which we place the charge sector of the d-wave metal. 
That is, for the d-wave metal we take an all-fermionic decomposition 
of the electron 


c5(r) =d,(r)do(r)f,(r) (1) 


Litre 
4 TH 


Figure 1 Schematic of the t-J-K model Hamiltonian. a, Picture of the full 
t-J-K model, equation (6), on the two-leg ladder. We use periodic boundary 
conditions in the long (x) direction for all calculations. b, Action of the ring 
term Hx, equation (8), on a single plaquette, elucidating its ‘singlet-rotation’ 
nature. 
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subject to the constraint 


di (r)di(r) =; (r)d(r)= S_ fi (n)fi(r) =ne(r) (2) 


The resulting theory now includes two gauge fields: one to glue 
together d, and d, to form the chargon and another to glue together 
band fto form the electron. In the Supplementary Information, we give 
a detailed bosonization analysis of this gauge theory for the two-leg 
ladder study (see below). 

Guided by the slave-boson construction, one can naturally construct 
electronic variational wavefunctions by taking the product of a hard- 
core bosonic wavefunction Wy, and a fermionic wave function Wand 
evaluating them at the same coordinates (Gutzwiller projection): 


¥(Lel tf) =PolmaltRn ad ({i pte p)] © 


where P, performs the projection into the physical electronic Hilbert 
space: {Rj} = {r] } U {r} \. If we put the f partons into a spin-singlet 
Fermi-sea state with orbitals {k&} (Slater determinant), that is, 
v)({7l }frth) = det [e's , “| det [et : n] =, then we can 


model both the Fermi-liquid metal and the non-Fermi-liquid d- 
wave metal in a unified way. In both cases, the wavefunctions are 
straightforward to implement using variational Monte Carlo (VMC) 
methods*?***. 

For the Fermi liquid, we put the b partons into a superfluid wave- 
function vi via a typical Jastrow form, so that, schematically, yt = 


Po| - x ‘ae Given that we is a positive wavefunction, the sign 


structure®® of w*" is identical to that of the non-interacting Fermi- 
sea state. In contrast, to model the d-wave metal, we put the b partons 
into a Bose-metal wavefunction according to the DBM construction 
of refs 21-24: 


Wi({Ri}) =Wa, ({Ri}) x Wa, ({Ri}) = ad (4) 


where 4, (w 4) is a Slater determinant with a Fermi sea compressed 
in the x (¥) direction”'. Then, we have 


yeroe ee pNP uF] Pola da xv] 


Interestingly, this construction, equation (5), is actually a time- 
reversal invariant analogue of the composite Fermi-liquid description 
of the half-filled Landau level**, where the d-wave Bose-metal wave- 
function”! has the role of Laughlin’s v = 1/2 bosonic state*’. Just as 
Laughlin’s wavefunction imprints a nontrivial complex phase pattern 
on the Slater determinant, the DBM wavefunction imprints a non- 
trivial d-wave sign structure. There are many physical signatures 
associated with putting the chargons into the DBM phase, making 
the d-wave metal dramatically distinguishable from the traditional 
Landau Fermi liquid (see below). 


Microscopic ring-exchange model 


The ¢-J-K model Hamiltonian which we propose to stabilize the 
d-wave metal phase is given by 


H=Hy+Hx (6) 


Ay =-t > (choe+ches) +>, S; . S; (7) 


lapse hd Gi) 


Hx =2K S— (SlsSx1+53,513) (8) 
Oo 


where (i, j) and C1 indicate sums over all nearest-neighbour bonds and 
all elementary plaquettes of the 2D square lattice, respectively. In the 


©2013 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


Figure 2 | Picture of the parton bands for the d-wave metal phase. We show 
orbitals for a 48 X 2 system, showing partially occupied bonding (k, = 0) and 
antibonding (k, = m) bands for d, and partially occupied bonding bands for d 
and f;,,; that is, each Slater determinant in equation (5) consists of momentum- 
space orbitals as depicted here. The total electron number is N. = Ney + 


spirit of the t-J model, we choose to work in the subspace of no doubly 


occupied sites, but for simplicity, we do ignore the term — nin 


present in typical definitions of the t-J model’. In equation (8), 
we have defined a singlet creation operator on two sites as Si= 


S Cs dl, —cl, cd.) so that Hx can be viewed as a four-site singlet- 
rotation term (see Fig. 1). For K > 0, the ground state of H, on a single 
plaquette with two electrons is a d,,-orbital spin-singlet; so, loosely 
speaking, Hx has a tendency to build d-wave correlations into the 
system and qualitatively alter the sign structure of the electronic 
ground state. Further arguments for studying this model in our inves- 
tigation of the d-wave metal can be found in the Supplementary 
Information. 

Although not particularly conventional, our ring-exchange term 
Hx (which should not be confused with four-site cyclic spin- 
exchange***°) is present when projecting the continuum many-body 
Hamiltonian for screened Coulomb-interacting electrons into a nar- 
row, tight-binding band“ (see Supplementary Information). In fact, 
estimating the strength of K, or coefficients on related terms, in real 
materials such as Laz_,Sr,CuO, is an interesting unresolved question. 


DMRG and VMC study of two-leg model 


Unfortunately, as with any interacting fermionic model, our t-J-K 
Hamiltonian suffers from the ‘fermionic sign problem’, rendering 
quantum Monte Carlo calculations inapplicable. We thus follow 
the heretofore successful” ****° approach of accessing 2D gapless 
phases by studying their quasi-1D descendants on ladder geometries, 
relying heavily on large-scale DMRG calculations. In fact, we have 
already established*** that for two, three and four legs, the DBM 
phase itself is the stable ground state of a boson ring-exchange model 
analogous to equation (6). Here, we take the important first step of 
placing the electron ring t-J-K model on the two-leg ladder in search 
of a two-leg descendant of the d-wave metal. 

For concreteness, we now consider the model, equation (6), on the 
two-leg ladder (see Fig. 1) at a generic electron density of p = N,/ 
(2L,.) = 1/3, where N. = N.-; + Nz, is the total number of electrons 
and L, is the length of our two-leg ladder (that is, the system has 
L, X 2 total sites). At this density p = 1/3 < 1/2 on the two-leg ladder, 
the non-interacting ground state is a spin-singlet wherein electrons of 
each spin partially fill the bonding band (k, = 0), leaving the anti- 
bonding band (k, = ™) empty. Thus, for t >> K, we expect the system 
to be in a simple one-band metallic state, which is a two-leg analogue 
of the Fermi liquid. Formally speaking, this phase is a conventional 
Luttinger liquid with two 1D gapless modes (central charge c = 2). For 
moderate values of ring exchange, K 2 t, we anticipate the unconven- 
tional non-Fermi-liquid d-wave metal to be a candidate ground state. 
On the two-leg ladder at this density, the d-wave metal phase has 
characteristic band-filling configurations for the d,, d, and fj, 


Ne} Na, Na, Nf t Na 32, with Nr _ Ny = 16 so that S,,; = 0; the 
longitudinal boundary conditions are periodic for d, and antiperiodic for d, 
and f;,,. This is precisely the same d-wave metal configuration for which we 
display characteristic measurements in Fig. 5. 


partons as shown in Fig. 2: d, partially fills both bonding and anti- 
bonding bands, whereas d, and f;/, fill only the bonding band. (The d; 
and d, configurations constitute the phase denoted “DBL[2,1]” in 
ref. 22.) In a mean-field approximation in which the partons do not 
interact, the system has five 1D gapless modes corresponding to the 
five total partially filled bands. However, in the strong-coupling limit 
of the full quasi-1D gauge theory (see the Supplementary Information 
for details), two orthonormal linear combinations of the original five 
modes are rendered massive, leaving an unconventional Luttinger 
liquid with c = 3 gapless modes. 

We now provide extensive numerical evidence that this two-leg 
descendant of the d-wave metal exists as the ground state of the 
t-J-K model over a wide region of the phase diagram. We summarize 
these results in Fig. 3 by presenting the full phase diagram in the 
parameters K/t versus J/t as obtained by DMRG calculations on length 

x = 24 and 48 systems at electron density p = 1/3. For small K, we 
find a conventional one-band (spinful) Luttinger liquid phase which 
is a two-leg analogue of the Fermi-liquid metal. For moderate J and 
upon increasing K, the system goes into the unconventional non- 
Fermi-liquid d-wave metal phase, which is the main focus of this 
work. The phase boundaries in Fig. 3, all of which represent strong 
first-order transitions, were determined by measuring several stand- 
ard momentum-space correlation functions in the DMRG (see the 
Supplementary Information for details): the electron momentum dis- 


tribution function cis) the density-density structure factor 


(6ng6n_q), and the spin-spin structure factor (S, + S_q) 

For concreteness, we now focus on the cut along J/t = 2 in Fig. 3 for 
a 48 X 2 system with N, = 32 electrons. We take one point deep 
within the conventional one-band metal at K/t = 0.5 and the other 


2.5 P ¢ 


Fermi-liquid 
1.5 metal 


Non-Fermi-liquid 
d-wave metal 
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Figure 3 | Phase diagram of the t-J-K electron ring-exchange model at 
electron density 7 = 1/3 on the two-leg ladder. In addition to the 
conventional one-band metal (‘Fermi-liquid metal’) and exotic ‘non-Fermi- 
liquid d-wave metal’, there are two other realized phases. For small J, there is an 
intermediate phase with fully polarized electrons. For large K, owing to the 
inherently attractive nature of ring-exchange interactions”, the system 
generally phase separates along the ladder. 
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Figure 4 | DMRG measurements in the conventional Luttinger liquid phase 
at J/t= 2 and K/t=0.5. We show the electron momentum distribution 
function (a), the density—density structure factor (b) and the spin-spin structure 
factor (c). The important wavevectors kp and 2kp, as described in the text, are 
highlighted by vertical dashed-dotted lines. 


point deep within the exotic d-wave metal at K/t = 1.8. First focusing 
on the former case, in Fig. 4 we show DMRG measurements char- 
acteristic of the conventional Luttinger liquid. The ground state is a 
spin-singlet with a sharp singularity in the electron momentum dis- 
tribution function at q,=0 and qy=kp= mNet / Lx =8 x 27/48, 
which is a usual Fermi wavevector determined solely from the elec- 
tron density. The density-density and spin-spin structure factors 
at qy=0 also exhibit familiar features at q,=0 and qy=2kp= 
16 x 27/48, both characteristic of an ordinary one-band metallic 
state with gapless charge and spin modes”’. We stress that, even with 
the constraint of no double-occupancy and non-zero K/t = 0.5 and 
J/t = 2, the interacting electronic system is still qualitatively very simi- 
lar to the two-leg free Fermi gas; analogously, the 2D Fermi liquid is 
in many ways qualitatively similar to the 2D free Fermi gas. In both 
cases, the main differences are basically quantitative and are well 
understood’””. 

We turn now to the characteristic point within the d-wave metal 
phase at J/f=2 and K/t= 1.8. In Fig. 5, we show a set of DMRG 
measurements at this point, as well as measurements corresponding 
to a variational wavefunction chosen such that its singular features 
best reproduce the DMRG data (see the Supplementary Informa- 
tion for details of our VMC methods). The selected d-wave metal 
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Figure 5 | DMRG measurements in the unconventional d-wave metal phase 
at J/t= 2 and K/t= 1.8. We show the same quantities as in Fig. 4. Here, we also 
show the matching VMC measurements using a d-wave metal trial 
wavefunction, depicted in Fig. 2. 


wavefunction is depicted schematically in Fig. 2. Specifically, we have 
the following parton Fermi wavevectors: 2k, ve =21 x 272/48, 
2k) =11 x 20/48, 2kpa, =32 x 2/48, and 2k =16 x 27/48. 
The overall agreement between the DMRG and VMC measurements 
is very compelling, and we now summarize our understanding of 
these results from the perspective of d-wave metal theory. 

In sharp contrast to the conventional Luttinger liquid, the elec- 
tron momentum distribution function now ied singularities for both 
= (KO) — | / 2. This 
wavevector corresponds to a composite ee made from a com- 
bination of parton fields consisting of a right-moving d, parton, a left- 


q,=0 and qy = at a wavevector qx = 


moving d, parton, and a right-moving spinon: d® dor fre. In fact, 
these ‘enhanced electrons’ can be guessed from simple ‘Amperian 
rules**** in our quasi-1D gauge theory, as described in detail in 
the Supplementary Information. 

The corresponding density-density and spin-spin structure fac- 
tors, displayed in Fig. 5b and c, also show nontrivial behaviour. We 
expect the density-density structure factor to be sensitive to each 
parton configuration individually and thus have singular features 
at various ‘2k,’ parton wavevectors (see refs 21, 23 and the Sup- 
plementary Information). In the DMRG no the most 
noticeable features are at q = 9 and qu =2k°) dy? 2K) dy? which allow 


©2013 Macmillan Publishers Limited. All rights reserved 


15 2 25. 3 
Kit 


Figure 6 | Evolution of singular wavevectors in the d-wave metal phase. At 
fixed J/t = 2 and varying K/t, we show the location of the dominant singular 
wavevector K, in the electron momentum distribution function (see Fig. 5a), 
as well as the wavevectors identified as 2kes and 2k, 7, in the density-density 
structure factor (see Fig. 5b). These calculations were done with DMRG. 


us to read off directly the realized d, parton configuration (see Fig. 2). 
The lack of these features in the VMC data, as well as the lack of 
analogous features at q, = 2k,,, in the DMRG data, can be under- 
stood within our gauge theory framework as presented in the Sup- 
plementary Information, where we also note that our wavefunction is 
only a caricature of the full theory. Finally, the spin-spin structure 
factor at q, = 0 not only has a familiar, expected feature at q,. = 2kpp 
coming from the spinon, but also remarkably contains a feature at 
qx = 2K, that can be thought of as a ‘2k; wavevector from the dom- 
inant ‘electron’ in Fig. 5a. All in all, as we further explain in the 
Supplementary Information, the DMRG measurements are consis- 
tent, even on a fine quantitative level, with being in a stable non- 
Fermi-liquid d-wave metal phase. 

We note that the wavevector K. depends on the interaction prengee 
K/t because the wavevectors ke 4, and Ke i vary with ring exchange”. 
In Fig. 6, we show at J/t = 2 evolution with K/t of the wavevector K,, 
that is, the location of the sharp steps in the electron momentum 
distribution function (see Fig. 5a), as determined by DMRG. Given 
that the momentum-space ‘volume’ enclosed by these singular fea- 
tures depends on the interaction K/t and is not simply determined by 
the total density of electrons, we may confidently say that the d-wave 
metal violates Luttinger’s volume theorem’. In fact, the very notion of 
a single ‘Fermi surface’ is actually ambiguous in the d-wave metal 
phase. We also show in Fig. 6, for those vanes of K/t at which they 
are discernible, the wavevectors 2k! 7 and aKit) d, aS identified by fea- 
tures in the DMRG-measured density—density structure factor at qy = 
0 (see Fig. 5b). For all points, the locations of the identified features 
satisfy the nontrivial identity K. = 2k, = 2h, | / 4, as predicted by 
our theory. 

A remarkable property of the d-wave metal state found in the 
DMRG is that it has prominent critical d-wave Cooper pairs residing 
on the diagonals, as anticipated earlier from the ring energetics (see 
the Supplementary Information). Such Cooper pair correlations have 
the slowest power-law decay of all the discussed observables, includ- 
ing the electron Green’s function. This is in stark contrast with a 
conventional metal and suggests that the d-wave metal phase has 
some incipient d-wave superconductivity in two dimensions. 

As a final piece of evidence that the realized DMRG phase is in fact 
the d-wave metal, we have measured the number of 1D gapless modes, 
that is, the effective central charge c, via scaling of the bipartite entan- 
glement entropy**** in the DMRG and VMC*** wavefunctions. As 
explained above, we expect c = 2 in the conventional Luttinger liquid 
and c = 3 in the d-wave metal. See the Supplementary Information for 
a detailed comparison of the DMRG and VMC entropy measurements, 
where we show that the DMRG-VMC agreement is just as impressive 
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Figure 7 | Central charge casa function of interaction K/t. By measuring the 
von Neumann entanglement entropy S, in the DMRG, we calculate the 
effective central charge c at fixed J/t = 2 and varying K/t. There is a dramatic 
jump from c~2 to c~3 at the transition, as predicted by our theory. Data for 
two example points—K/t = 0.8 and 1.8—are shown in the inset, where X is the 
number of rungs in each bipartition. (See also the Supplementary Information.) 


as it is for the more traditional measurements of Fig. 5. The effective 
central charge versus K/t at J/t = 2 as determined by the DMRG is 
shown in Fig. 7. Indeed, these measurements indicate that c~2 in 
the conventional one-band metal, whereas c~ 3 in the exotic d-wave 
metal. Because c = 3 > 2, our putative d-wave metal phase clearly can- 
not be understood as an instability out of the conventional one-band 
metal, but also, because c = 3 < 4, the critical bonding and antibonding 
electrons in Fig. 5a cannot be reproduced by any perturbative treatment 
starting from free electrons*’ (see also the Supplementary Information). 


Discussion and outlook 


Here we have presented strong evidence for the stability of a two-leg 
descendant of our exotic strange-metal phase, the d-wave metal, and 
we conclude with an outlook on exciting future work. Firstly, our 
present two-leg d-wave metal treatment is readily extendable to sys- 
tems with more legs. Ref. 24 established the stability of the d-wave 
Bose metal, the main ingredient of the d-wave metal, on three- and 
four-leg ladders. Thus, we do not envision any conceptual obstacles in 
the way of realizing a similar result for the d-wave metal. However, we 
do anticipate that adding more legs will be very challenging numeri- 
cally for the DMRG owing to the large amount of spatial entangle- 
ment present in the d-wave metal and Fermi liquid—this is also the 
current limitation preventing modern 2D tensor network state meth- 
ods from attacking such problems”. 

With the goal of connecting to experiments, it would be desirable to 
perform a detailed energetics study of the t-J-K model in two dimen- 
sions and explore the applicability of such models to strongly corre- 
lated materials. By studying 2D variational wavefunctions based on the 
d-wave metal, it should be possible to compare physical properties with 
experimentally observed strange metals, such as that in the copper 
oxides. This could include various instabilities of the d-wave metal, 
such as spinon pairing as a model of a pseudogap metal or chargon 
pairing as a model of an “orthogonal metal”, discussed recently’. The 
d-wave sign structure already inherent in the non-superconducting 
parent d-wave metal suggests that there may be incipient d-wave 
superconductivity of the copper-oxide variety, which is particularly 
exciting. Although we have stressed its Luttinger volume violation as 
a characteristic non-Fermi liquid property of the d-wave metal, we note 
that the 2D phase will also have no Landau quasiparticle as well as 
exhibit non-Fermi-liquid-like thermodynamics and transport. Com- 
paring these predictions with properties of real strange metals would be 
interesting. In the end, however, we stress the conceptual nature of the 
present study, and hope that our ideas may open up new avenues for 
thinking about non-Fermi liquid electronic fluids. 
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Whereas large-scale efforts have rapidly advanced the understanding and practical impact of human genomic variation, 
the practical impact of variation is largely unexplored in the human microbiome. We therefore developed a framework for 
metagenomic variation analysis and applied it to 252 faecal metagenomes of 207 individuals from Europe and North 
America. Using 7.4 billion reads aligned to 101 reference species, we detected 10.3 million single nucleotide polymor- 
phisms (SNPs), 107,991 short insertions/deletions, and 1,051 structural variants. The average ratio of non-synonymous to 
synonymous polymorphism rates of 0.11 was more variable between gut microbial species than across human hosts. 
Subjects sampled at varying time intervals exhibited individuality and temporal stability of SNP variation patterns, despite 
considerable composition changes of their gut microbiota. This indicates that individual-specific strains are not easily 
replaced and that an individual might have a unique metagenomic genotype, which may be exploitable for personalized 


diet or drug intake. 


With the increasing availability of individual human genomes, various 
theoretical and practical aspects of genomic variation can be deduced 
for individuals and the human population as a whole’. Like sequenced 
human genomes, the number of human gut metagenomes (currently 
mostly derived from Illumina shotgun sequencing of stool samples) is 
increasing exponentially. Given the importance of the gut microbiota 
in human health*“ and a growing number of studies reporting associa- 
tions between gut microbiota and diseases**, an understanding of 
genomic variation in gut microbial populations will probably trigger 
applications towards human well-being and disease. 

For example, in the common gut commensal bacterium Escherichia 
coli, just three point mutations in two genes can confer clinically 
relevant antibiotic resistance’, and natural variation in a single gene 
can lead to pathogenic adaptation®. Even within pathogenic species in 
the gut, closely related coexisting strains can exhibit different patho- 
genic potentials due to minor genomic variation’. These examples 
illustrate how genomic variation within gut microbes could confer 
phenotypes that require personalized care or treatment of the host. 

Studies based on 16S ribosomal RNA gene surveys or whole meta- 
genome shotgun sequencing characterized taxonomic and functional 
compositions of healthy individuals’ and intestinal bowel disease 
patients’ gut microbiota at the genus or species level®”"'’. Variation in 
taxonomic abundance as well as functions encoded by these gut micro- 
biota have been described between individuals*"' and used to stratify 
individuals according to their gut community compositions into 
enterotypes'*. However, genomic variation within species, which leads 
to their phenotypic diversity and adaptations to different environ- 
ments, has only been studied in a few taxa, such as Citrobacter spp’. 

An early landmark study on a small data set described metage- 
nomic variation in an acidic biofilm microbiome of low complexity”. 
The population structure for one species in that habitat was studied 
and positive selection was observed in some genes'*. Another recent 
study resolved multiple clinical isolates of methicillin-resistant 


Staphylococcus aureus and delineated its epidemiology and micro- 
evolution based on genomic variation'®. With the availability of 
hundreds of deeply sequenced human gut metagenomes”’”””, suf- 
ficient data are becoming available for quantitative analyses of the 
genetic structure of complex microbial communities, allowing the 
study of many species at the same time. 

Here we analysed 1.56 terabases of sequence data from 252 stool 
samples from 207 individuals (Supplementary Table 1 and Sup- 
plementary Notes) obtained from the MetaHIT project (71 Danish, 
39 Spanish; all sampled once'’), the NIH Human Microbiome Project 
(94 US samples; 51 individuals sampled once, 41 sampled twice and 2 
sampled three times”), and Washington University (three US samples; 
all sampled once’’). Our goals were to: (1) develop a framework for 
genomic variation analysis using metagenomic shotgun data; (2) 
gather basic knowledge on the genomic variation landscape in gut 
metagenomes; and (3) gain insights into the individuality, temporal 
stability and biogeography of metagenomic variation. 


Framework for metagenomic variation analysis 


We used 1,497 prokaryotic genomes to generate a set of reference 
genomes (Supplementary Table 2) for the analysis of genomic vari- 
ation in gut microbial species in 252 samples (on average 6.2 + 4.1 
gigabases (Gb) were analysed). Pairwise comparisons of 40 universal 
marker genes'*”” identified in these genomes were performed to 
create a set of 929 clusters based on a 95% DNA identity threshold 
recommended for identifying species”. The genome recruiting the 
highest number of reads in a cluster was selected as reference for that 
species (see Methods and Supplementary Information). 

Using the same 95% identity threshold, we mapped 7.4 billion 
metagenomic reads (42% of the total, 91% thereof uniquely) with 
an average length of 80 base pairs (bp) to the 929 reference genomes 
(Supplementary Tables 1 and 3). To avoid mapping artefacts (for 
example, caused by high coverage of prophages), we required =40% 
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of each reference genome to be covered by reads (corresponding to 
the gene content similarity between two strains of E. coli*'). The 
resulting 101 prevalent species with base-pair coverage from 12 to 
32,400 X (Fig. 1 and Supplementary Fig. 1) were subjected to genomic 
variation analysis. 

To enable comparative analyses in multiple metagenomes and to 
identify low-frequency variants not detected when analysing them 
individually, we used multi-sample calling” to identify SNPs, short 
insertions/deletions (indels; 1-50 bp) and structural variations (>50 bp) 
in each genome, although structural variations were largely under- 
estimated due to small insert sizes (Supplementary Information). We 
only called variants with allele frequency =1% (the conventional 
definition of polymorphism”) and supported by =4 reads. False-pos- 
itive rates were estimated at 0.71% for SNPs and 1.04% for structural 
variations (Supplementary Information, Supplementary Tables 4 and 
5, and Supplementary Fig. 2). 


Genomic variation in prevalent gut species 

We identified 10.3 million SNPs in 101 genomes (3.1% of the total 
329 Mb) across 252 samples from 207 subjects, almost as many as 
the 14.4 million SNPs recently identified in 179 human genomes’. 
Within an individual the rate was lower (on average 1.21%, see Sup- 
plementary Table 6), yet SNPs kb ' increased with base-pair coverage 
when samples were pooled (Supplementary Fig. 3). We also identified 
107,991 indels and 1,051 structural variants in these 101 species 
(Supplementary Information). Their relative ratios to SNPs (10,485 
short indels and 102 structural variants per million SNPs) were robust 
across species and individuals (Supplementary Fig. 4). Subsequent 


analyses were restricted to SNPs due to their orders of magnitude 
higher count over other variation types. 

We annotated the genes of the prevalent genomes using orthologous 
groups from eggNOG” (Supplementary Information) and found that 
the orthologous groups with the highest SNP density were enriched in 
functions related to conjugal transfer of antibiotic resistance (Sup- 
plementary Table 7). For example, the orthologous group with the 
highest average SNP density across samples was the clindamycin resis- 
tance transfer factor btgA (NOG119724), required for conjugative 
transmission of plasmids. Mutations commonly accrued from the 
process of conjugation may account for increased diversity among 
conjugation-associated functions. Additionally, CRISPR-associated 
proteins, responsible for conferring resistance in bacteria, were also 
found among the orthologous groups with high SNP densities (Sup- 
plementary Information and Supplementary Table 7). 

The large number of SNPs provided the opportunity to compare, 
for the first time at such scale, the evolution of different coexisting 
species across a large cohort of individuals. To evaluate selective con- 
straints in these species in their natural habitat, we estimated the ratio 
of non-synonymous to synonymous polymorphism rates**”° (pN/pS) 
within each species in every sample (Fig. 1 and Supplementary 
Information). pN/pS characterizes selective constraint at the level of 
a population contrary to the more commonly used dN/dS that charac- 
terizes it between individual species”*. To validate pN/pS ratios, we 
estimated genetic variation using the sample-size-independent nucleo- 
tide diversity z, and found that z is highly correlated with SNPs kb ' 
(Fig. 1 and Supplementary Fig. 5). The derived measures of z(N)/n(S) 
and m(non-degenerate sites)/™(fourfold degenerate sites), the latter of 
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Figure 1 | Genomic variation statistics for 101 gut microbial species 
prevalent in 252 samples from 207 individuals. Genomic variation statistics 
were calculated for 101 prevalent gut microbial species, operationally defined as 
having =10X cumulative (over all samples) base-pair coverage with at least one 
sample exhibiting a genome coverage of =40%. The 66 dominant species 
(indicated by an asterisk), which account for 99% of the mapped reads, were 
used for analyses that required high base-pair coverage. Species names are given 
without strain specifications unless this would result in duplicate entries. The 
blue point cloud plots show the coverages (21) in all samples, with the blue 
dot above indicating the cumulative coverage and the red dot the maximum 
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coverage across all samples. Grey shaded areas indicate the level of base-pair 
coverage at which abundance effects have only minor effects on SNPs kb ' and 
pN/pS ratios of the pooled samples (Supplementary Information). SNP counts 
appear to saturate at approximately 500, with minor increases at higher 
coverages probably due to the sampling of rare variants at low rates. In 
individual samples, pN/pS is largely stable from a coverage of 10X onward 
(Supplementary Fig. 7), corresponding to approximately 200 cumulative 
coverage in our sample set. Nucleotide diversity z follows SNPs kb” * closely, as 
does the derived measure of z(N)/z(S) with respect to pN/pS. 
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which is less dependent on specific properties of mutation spectra such 
as transition and transversion ratios, were coherent with pN/pS 
(Supplementary Fig. 6). 

The pN/pS ratio of a genome within a sample remained stable at 
coverages higher than 10 (Supplementary Fig. 7)—yet another 
indication of few false SNP calls—and was on average 0.11, but varied 
considerably for different species (0.04 to 0.58) in accordance with 
dN/dS ratios estimated independently in a number of interspecies 
comparisons between closely related bacteria and archaea”””®. 


pN/pS across gut species and individuals 


Because meaningful comparison of genomic variation requires 
both breadth (across samples) and depth (in number of base pairs) 
of sequencing, we focused on the 66 most dominant species that 
attracted >99% of the reads (Fig. 1). Their relatively low pN/pS ratios 
were constant across different hosts (Fig. 2a and Supplementary Table 
8), which may indicate similar selective constraints across individuals. 
Thus, the evolution of gut species is probably dominated by long-term 
purifying selection and drift rather than rapid adaptations to specific 
host environments. The wide range of these ratios across species may 
suggest that different gut species face different evolutionary constraints. 

To investigate how different gut species respond to the pressure 
from the gut environment, we compared the pN/pS ratios of indi- 
vidual genes in Roseburia intestinalis and Eubacterium eligens, which 
differed considerably in their overall mean pN/pS ratios (0.236 
(R. intestinalis) versus 0.131 (E. eligens) from 106 and 147 samples, 
respectively) despite having comparable average base-pair coverages 
(Supplementary Information). Whereas 75% of the genes in R. intes- 
tinalis had systematically higher pN/pS ratios compared to their 
orthologues in E. eligens, few others showed considerable deviations 
(Fig. 2b and Supplementary Table 9), indicative of differing evolu- 
tionary constraints for these genes. For example, galK, the gene encoding 
galactokinase, an essential enzyme in the Leloir pathway for galactose 
metabolism in most organisms”, was among the lowest in terms of 
pN/pS ratio in R. intestinalis but among the highest in E. eligens (0.03 
and 0.48, respectively; Fig. 2b, c). Although present in E. eligens, this 
gene may not exert its main function (see also ref. 30), as E. eligens 
cannot ferment galactose, nor the galactose-containing disaccharides 
lactose and melibiose*!. On the other hand, R. intestinalis is known to 
ferment melibiose*’, indicating that its galK gene is functional (Sup- 
plementary Information). Thus, the same gene can be under tight 
negative selection in one species but under more relaxed negative 
selection in another. 

Our framework allowed us additionally to obtain information on 
all genes in each sample (Supplementary Information). As expected, 
we found that housekeeping genes had usually lower pN/pS ratios. For 
example, the DNA-dependent RNA polymerase f-subunit gene was 
consistently among the genes exhibiting the lowest pN/pS ratios 
across samples and species (Supplementary Table 10). Less obvious 
examples included genes related to type IV secretion systems used to 
transfer DNA between microbes* and involved in host interactions 
of both pathogenic*? and commensal bacteria”, specifically in anti- 
inflammatory responses and immune modulation**. The low pN/pS 
ratio of genes related to type IV secretion systems suggests that main- 
taining genome plasticity and antibiotic resistance through conjuga- 
tive transposition is essential in the constantly changing environment 
of the gut and that the interaction with the host immune system is 
under purifying selection (Supplementary Table 10). We also found 
a few conserved unknown, but apparently gut-microbe-specific, 
proteins that exhibited low pN/pS ratios, suggesting that they perform 
important yet hitherto unexplored functions (Supplementary Table 
11). 

Among the genes or orthologous groups with consistently the high- 
est pN/pS ratios were many transposases and antimicrobial resistance 
genes including the gut-specific gene bile salt hydrolase (BSH)’° 
(Supplementary Table 10). Conjugated bile acids (CBAs) secreted 
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by the hosts repress microbial growth and upregulate the host mucosal 
defence system. BSHs are involved in the initial reaction in the meta- 
bolism of CBAs by gut microbes”. Their high pN/pS ratio may be 
indicative of the genomic plasticity necessary to metabolize and survive 
the variety of different bile acids present in the gut”. 


Temporal stability of individual SNP patterns 


Several studies on adult human gut microbial samples from a few 
individuals have suggested that within-individual differences over time 
are smaller compared to between-individual differences in microbial 
species composition and abundance**’. Within a larger cohort, indi- 
viduality of host-associated microbiota has been reported on the basis 
of 16S rRNA gene profiling of fingertip-associated communities”, 
whereas other studies ona few samples have investigated the persistence 
of specific strains over time’. However, intra-species variation at 
nucleotide resolution at the whole-genome level and accompanying 
changes in species abundances within the human gut over long time 
periods (>l year) have not been studied yet in large cohorts. It is 
unclear if the concept of resident strains is common to other prevalent 
species, if host-specific strains are retained over time, and how fast they 
evolve inside the gut environment. 

To explore these questions, we used 88 gut metagenomes from 
43 healthy US subjects (a subset of our cohort) from whom at least 
two samples were obtained at different time points with no antibiotics 
treatment in between (Supplementary Table 12). To measure how 
similar the subpopulations (strains) of the dominant species were 
between two samples, we estimated the fixation indices (F,;) between 
the populations (Supplementary Information). Because this measure 
depends on allele frequencies, which cannot be determined accurately 
at low base-pair coverage, we also estimated the fraction of alleles shared 
between the samples out of all polymorphic sites (only 49 genomes that 
accrued 40% genome coverage in at least two samples were used and 
genomes with >10% base-pair coverage were down-sampled to 10; 
Supplementary Information and Supplementary Fig. 3). Because the 
fraction of shared SNPs depends on the number of variable sites, we 
developed a heuristic allele sharing similarity score that takes into 
account both the number of variable sites and the fraction of shared 
alleles (Supplementary Information). 

When we compared all 252 samples, Fs; was significantly lower 
and allele sharing significantly stronger between different samples 
from the same individuals than between samples from different indi- 
viduals (Mann-Whitney U-test: P< 0.001 for both; see Fig. 3a, b and 
Supplementary Information). The same trend was observed, albeit 
much weaker, based on species compositions (Fig. 3c), in line with 
previous observations from microbial composition-based results**°. 
Intra-individual variation being smaller than inter-individual vari- 
ation does not require that samples from the same individual are more 
similar to each other than to any other sample in the tested cohort. 
Our results showed for both measures of variation similarity that all 
but one of the 88 multi-time-point samples had the highest similarity 
to another sample from the same individual, which was not true when 
comparing species abundance over time (Fig. 3c and Supplementary 
Information). This indicates that species abundance in gut micro- 
biota cannot serve as a fingerprint of an individual whereas variation 
patterns might. 

Wealso tested whether differences in F,; and allele sharing decreased 
over time, which may indicate a divergence of the strains or a hori- 
zontal transmission of strains from the environment; however, the 
individual-specific variation patterns remained stable over all of the 
time intervals monitored (Fig. 3). Although this stability should be 
verified for longer periods as well as when antibiotic treatment or other 
gut microbiota-challenging events have taken place, our observation 
indicates that healthy human individuals retain specific strains (see 
also Supplementary Tables 12 and 13) for at least 1 year. 

In contrast to the strong evidence for individuality and temporal 
stability of SNP patterns, we did not observe a significant geographical 
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Figure 2 | pN/pS ratios of 66 dominant species reveal more variation 
between species than between individuals. a, A heat map of pN/pS ratios for 
the 66 dominant species (rows) and 207 individuals (columns; only the first 
time-point per individual) is shown and summarized by species (box plots on 
the right). Rows and columns are ordered by their mean pN/pS ratios, which 
vary considerably between species, but have a tighter bandwidth across 
samples. Two genomes that are exceptions to this trend (indicated by an 
asterisk) might indicate higher strain diversity. The panel above the heat map 
indicates the continent of residence for each individual. A significant difference 
was found in the mean pN/pS ratios between the two continents, although this 
is probably an effect of lower sequencing depths of European samples 
(Supplementary Table 8) that leads to missing data points in some samples (see, 


separation between European and US samples (Supplementary Fig. 8 
and Supplementary Table 14). This indicates that long-term hori- 
zontal transmission of at least some dominant gut microbial strains 
cause geographical mixing over time. The strongest continental sepa- 
ration, based on Fgy, was seen in Bacteroides coprocola (Fig. 4), which 
was also the only genome with sufficient amounts of data that showed 
continental separation based on the allele sharing score (Supplemen- 
tary Fig. 8 and Supplementary Tables 14 and 15). 
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for example, top-right corner). b, The distributions of average pN/pS ratios of 
individual genes from Roseburia intestinalis and Eubacterium eligens (both 
highlighted in a) illustrate that, although base-pair coverages are similar, the 
pN/pS ratio of R. intestinalis is higher in general. The relative pN/pS ratios of 
orthologous groups in the two species are shown in the inset, with the average 
log, ratio indicated by the solid line and the random expectation by the dashed 
line. Outliers can be revealed this way, like the galactokinase gene (galK), the 
pN/pS of which is among the lowest in R. intestinalis and the highest in 

E. eligens. c, Illustration of low and high pN/pS ratios in galK genes from 

R. intestinalis (top panel) and E. eligens (bottom panel). The cumulative read 
coverage is shown in grey with synonymous (green) and non-synonymous 
(brown) changes marked at the nucleotide positions they occur. 


Discussion 

We have established a framework for gut microbial genomic variation 
analysis using metagenomic data and identified in a single analysis— 
involving 252 stool samples from 207 human individuals—almost as 
many SNPs in the human gut microbiome as the 1000 Genomes 
Project recorded in 179 human individuals over several years”. The 
stable pN/pS ratios of gut microbial species across individuals suggest 
that host conditions (such as diet, genetic differences, and immune 
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Figure 3 | Individuality and temporal stability of genomic variation 
patterns. a—c, Samples from 43 individuals that were sampled at different time 
intervals (red dots) were compared with the most similar sample from a 
different individual (blue dots) in terms of population similarity that takes allele 
frequencies into account (a), allele sharing similarity score that takes SNP 
counts and the ratio of shared SNPs into account (b) (Supplementary 
Information) and species abundance similarity measured using the Jensen— 
Shannon distance’ (c). The most similar sample is the one with the lowest Fs 
value in a, the highest allele sharing similarity score in b and the lowest Jensen- 
Shannon distance in c. The three similarity measures are plotted against the 
number of days between the sampling time points. The mean across all intra- 
individual, best inter-individual and all inter-individual similarities is shown as 
red, blue and grey lines, respectively. For both population similarity and allele 
sharing similarity between samples from the same individual, all but one 
sample (resulting in two outliers due to comparisons with two other time- 
points, see Supplementary Table 12) shared the highest similarity with another 
sample of the same individual, providing strong evidence for individuality of 
SNP sharing patterns. No decline of similarity over time could be observed. 


tolerance) have a minor influence on the evolution of species com- 
pared to constraints common to the human population (such as 
gut physiology, anaerobic conditions and pH). In the 66 dominant 
species, the analysis of more than 229,000 genes comprising about 
8,000 orthologous groups pinpointed consistently fast or slow evolving 
genes across individuals (Supplementary Table 10). However, further 
studies are needed to interpret different selection types at the gene level. 

The availability of time-point data revealed that individual-specific 
variation patterns were remarkably stable over time (Fig. 3a, b), which 
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Figure 4 | Inter-continental comparison of gut microbial species. Between- 
continent Fey values for eight genomes with =10 samples representing each 
continent are shown. Bacteroides coprocola was the species with the highest Fs 
value, implying a separation between the B. coprocola populations in Europe 
and North America (see also Supplementary Information; all data are available 
in Supplementary Table 14). 
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was much less the case for similarities in species abundance—for 
almost 60% of the samples, a sample from a different individual was 
the most similar (Fig. 3c). Thus, the metagenomic variation patterns 
observed here support the hypothesis that a healthy individual retains 
specific strains for extended periods of time. This suggests that each 
individual has a metagenomic variation profile that could be unique 
even in very large cohorts. It should be noted that the maximal sampling 
period was only 1 year, and 43 individuals might not be sufficient to 
trace horizontal transmissions of strains. The likelihood of the latter is 
supported by the apparent absence of clear continental stratification 
(despite different sampling and sequencing protocols of the European 
and the US samples), although for one out of eight species analysed, 
Bacteroides coprocola, we provide preliminary evidence (Fig. 4 and 
Supplementary Fig. 8). Geographical stratification has been described 
for Helicobacter pylori***, and weak but detectable signals have also 
been observed in some bacterial pathogens***’. Thus, we expect more 
gut microbial stratification patterns to emerge when larger data sets 
under standardized sampling and sequencing protocols become avail- 
able, although it remains to be tested which factors (such as geographi- 
cal separation, diseases, host-genetic and life-style/diet factors) shape 
the distribution of gut microbial strains and segregating SNPs within 
the population. The absence of clear geographical stratification implies 
that stable differences in variation patterns of gut species are not 
explained by large-scale structures of local microbial populations. 
They may rather be a result of genetic drift due to population bottle- 
necks that could occur not only during the colonization of the infant 
gut but also by processes causing community shifts during adult life 
stages, followed by a rapid population growth accompanied by purifying 
selection. This model suggests that the source of genetic variation in 
human gut microbial populations is less likely to be new mutations 
within the host than the variation in the initial colonizing populations 
or transmissions from the environment. This would imply that most 
allelic variants analysed in this study segregate at timescales greatly 
exceeding human generation time. 

The introduction of large-scale variation analysis in metagenomic 
data of complex communities and the discovery of individual meta- 
genomic variation profiles open up several applications. It is now 
possible to screen in silico for many pathogenic or antibiotics resist- 
ance variants in the population. Once a sample has been analysed, the 
data can also be used in the future given the temporal stability of SNP 
profiles. As it took years to identify marker genes and variations for 
diseases or phenotypes in the human genome, the variation landscape 
uncovered here can only be seen as the beginning to find molecular 
biomarkers including particular variants that reveal useful informa- 
tion for human health and well-being. 


METHODS SUMMARY 

Mapping to non-redundant genomes. A reference genome set representing 929 
species was derived from a total of 1,497 published prokaryotic genomes, based 
on a median sequence identity of 95% in 40 universal, single-copy marker 
genes'*””. Metagenomic reads from 252 samples were aligned to these 929 genomes 
using the same 95% sequence identity cutoff. 

Coverage. For each genome, we calculated the sample-specific base-pair coverage 
and the number of bases of the genome covered by at least one read. For a genome 
to be considered we required a cumulative depth of coverage of =10X across all 
samples. To remove species that are not present in our cohort, yet attract reads due 
to highly conserved regions, we required at least 40% breadth of the genome 
coverage (the criterion for the species to be considered present) from at least 
one sample. 

Variation detection. We performed SNP calling on the pooled samples and only 
considered bases with a quality score =15. We required SNPs to be supported 
by =4 reads and to occur with a frequency of =1%. Structural variations were 
detected using Pindel”*. False-positive rates in SNP and structural variation detec- 
tion were estimated using nonsense and frameshift mutations in 40 essential 
single copy marker genes. 

mand Fey. We estimated nucleotide diversity (zz) and fixation indices (Fsr) on the 
basis of allele frequencies. 


3 JANUARY 2013 | VOL 493 | NATURE | 49 


©2013 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


pN/pS ratio. SNPs occurring in coding regions were classified as synonymous or 
non-synonymous. Genes from the non-redundant genomes were annotated 
using eggNOG orthologous groups allowing calculation of pN/pS ratios at the 
level of genomes, orthologous groups and genes. 

Pairwise sample comparisons. Similarity in strain populations between two 
samples was estimated using a similarity score based on shared SNPs and 
using Fer. 
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CCRS is a receptor for Staphylococcus 
aureus leukotoxin ED 


Francis Alonzo III', Lina Kozhaya'*, Stephen A. Rawlings'*, Tamara Reyes-Robles'*, Ashley L. DuMont, David G. Myszka’*, 


Nathaniel R. Landau!, Derya Unutmaz>*? & Victor J. Torres? 


Pore-forming toxins are critical virulence factors for many bacterial pathogens and are central to Staphylococcus 
aureus-mediated killing of host cells. S. aureus encodes pore-forming bi-component leukotoxins that are toxic 
towards neutrophils, but also specifically target other immune cells. Despite decades since the first description of 
staphylococcal leukocidal activity, the host factors responsible for the selectivity of leukotoxins towards different 
immune cells remain unknown. Here we identify the human immunodeficiency virus (HIV) co-receptor CCR5 as a 
cellular determinant required for cytotoxic targeting of subsets of myeloid cells and T lymphocytes by the S. aureus 
leukotoxin ED (LukED). We further demonstrate that LukED-dependent cell killing is blocked by CCR5 receptor 
antagonists, including the HIV drug maraviroc. Remarkably, CCR5-deficient mice are largely resistant to lethal S. 
aureus infection, highlighting the importance of CCR5 targeting in S. aureus pathogenesis. Thus, depletion of CCR5* 
leukocytes by LukED suggests a new immune evasion mechanism of S. aureus that can be therapeutically targeted. 


S. aureus is a bacterial pathogen that causes significant morbidity and 
mortality worldwide. The organism is responsible for a myriad of 
diseases, from skin and soft-tissue infections, to more invasive dis- 
eases including necrotizing pneumonia and sepsis. S. aureus secretes 
several protein products that allow the organism to subvert the host 
immune system. Such factors include super-antigens, antibody bind- 
ing proteins, cytolytic peptides and pore-forming cytotoxins’, 

Pore-forming toxins are secreted by a substantial number of patho- 
genic bacteria*. The toxins are secreted as water-soluble monomers 
that recognize host cell membranes, oligomerize, and insert «-helical 
or B-barrel pores into the lipid bilayer’. Pore formation disrupts 
osmotic balance and membrane potential, ultimately leading to cell 
death*. S. aureus strains that infect humans produce up to four 
different B-barrel, bi-component, pore-forming toxins (HlgACB, 
LukED, LukSF-PV/PVL and LukAB/HG) that exhibit a unique tro- 
pism for host immune cells and contribute to the greater virulence 
of S. aureus'**. The precise repertoire of immune cells targeted by 
the pore-forming leukotoxins remains to be fully determined. Even 
now, more than a century since the first description of staphylococcal 
leukocidal activity”®, our understanding of leukotoxin function in vivo 
is limited because of an absence of known host-derived specificity 
determinants. 


CCRS is required for LukED cytotoxicity 

To identify potential leukotoxin receptors, we purified recombinant 
LukED, LukAB and LukSF-PV and assessed their ability to killa set of 
human cell lines*”. Granulocyte-like human cells (PMN-HL60) were 
killed in 1h by LukAB and LukSF-PV, but not LukED (Fig. 1a). In 
contrast, LukED was cytotoxic to a human T-cell line ectopically 
expressing CCR5 (HUT-RS5); whereas another T-cell line (Jurkat), 
which lacks detectable CCRS, was insensitive (Fig. 1a). This suggested 
that CCR5 was involved in LukED cytotoxicity towards HUT-RS cells. 
Accordingly, when CCR5 amounts were reduced in HUT-RS cells 
using lentiviral CCR5 short hairpin RNA (shRNA), the cells were 


protected from LukED-mediated killing (Fig. 1b and Supplemen- 
tary Fig. la, b). 

Complementary to these findings, ectopic expression of CCR5 
was sufficient to render Jurkat and H9 cells (Supplementary Fig. 1c) 
susceptible to LukED cytotoxicity (Fig. 1c). As expected, on the basis 
of the mode of action of the bi-component leukotoxins, CCR5- 
dependent LukED-mediated cytotoxicity required both LukE and 
LukD subunits (Supplementary Fig. 2a, b). A human osteosarcoma 
cell line engineered to constitutively express CCR5 (GHOST.RS cells)® 
was also sensitive to LukED, but not to LukAB or LukSF-PV (Fig. 1d). 
The sensitivity of GHOST cells to LukED was specific to CCR5 
expression, as overexpression of additional T-cell-specific chemokine 
receptors (CCR1, CCR2, CCR3, CKCR4, CCR8 and CXCR6) in these 
cells did not confer susceptibility to LukED (Supplementary Fig. 2c). 


CCR5 antagonists block LukED cell killing 


CCRS is a co-receptor required for HIV infection’”’ and has been 


targeted with small molecule antagonists aimed at restricting HIV entry 
into host cells'’. We found that one such clinically approved receptor 
antagonist, maraviroc, potently blocked LukED killing of CCR5* cells 
(Fig. le and Supplementary Fig. 3a) at concentrations similar to those 
required to block HIV infection (Supplementary Fig. 3b). Similar inhi- 
bitory effects were observed with the CCR5 antagonists vicriviroc and 
TAK-779, as well as chemokines that are natural ligands of CCR5 
(Supplementary Fig. 3a, c)'*'’. We found that maraviroc resulted in 
complete blockade of LukED pore formation, an essential process for 
cytotoxicity (Fig. 1f and Supplementary Fig. 3d). 

We next investigated whether S. aureus was able to kill CCR5” cells 
in a LukED-dependent manner. The expression of IukED in S. aureus 
is inherently low during in vitro growth’. However, deletion of the 
transcription factor Rot, a potent repressor, results in the enhanced 
expression and production of LukED by S. aureus’. Thus, to assess 
S. aureus cytotoxicity towards CCR5* cells, Jurkat or Jurkat-R5 cells 
were infected with S. aureus Arot (Sa LukED*) and S. aureus 
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Figure 1 | LukED requires CCR5 for cell killing. a, Viability of cells exposed 
to different leukotoxins (10 ug ml '). b, Viability of HUT-R5 cells transduced 
with control (Ctrl) or CCR5 shRNAs. c, Viability of Jurkat and H9 cells 
transduced with CCRS5 (-R5) or mouse CD24 (-HSA) followed by treatment 
with LukED. d, Viability of GHOST cells overexpressing CCRS5 and treated 
with indicated leukotoxins. e, Viability of HUT-R5 pre-incubated with 
maraviroc and treated with LukED. f, Pore formation, as measured by ethidium 
bromide uptake, on Jurkat-R5 with or without maraviroc (MVC; 100 ng ml ') 
followed by incubation with LukED. g, h, Viability of Jurkat or Jurkat-R5 cells 
infected with S. aureus (g), in the presence or absence of MVC (h). Means = s.d. 
(n = 3) are shown. 


ArotAlukED (Sa LukED~) mutants. Jurkat-R5 cells were killed by 
S. aureus in a LukED-dependent manner, whereas Jurkats lacking 
CCRS were resistant to killing (Fig. 1g). Additionally, Jurkat-R5 kill- 
ing by S. aureus was completely blocked by maraviroc (Fig. 1h). 


LukE interacts directly with CCR5 


To characterize more precisely the LukED-CCRS interaction on tar- 
get cells, we first determined whether monoclonal antibodies speci- 
fic towards extracellular regions of CCR5 (ref. 14) were sufficient to 
block toxin activity (Fig. 2a). Antibodies against extracellular loop 2 
(ECL-2), but not the amino (N) terminus of the receptor or CXCR4, 
significantly blocked toxin killing (Fig. 2a) and prevented association 
of functional green fluorescent protein (GFP)-labelled toxin (Sup- 
plementary Fig. 4) with the cell surface of sorted primary human 
CD4*CCR5™ T cells (Fig. 2b). Furthermore, toxin association with 
the cell surface of CCR5™ cells was also reduced in the presence of 
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Figure 2 | LukE directly interacts with CCR5. a, Viability of cells treated with 
anti-CCR5 monoclonal antibodies (35 jg ml") followed by exposure to 
LukED (10 pg ml~ ').b, Membrane association of GEP-LukED (10 ug ml ~ '\ to 
the surface of primary CD4*CCR5* T cells with or without the indicated 
monoclonal antibodies (25 1g ml’) as determined by fluorescence-activated 
cell sorting (FACS). c, Membrane association of GFP-LukED (10 pg ml) on 
the surface of primary CD4* CCR5* T cells with or without maraviroc (MVC) 
(100 ng ml~'), CCLS5 (5 Lig ml~') or on CD4*CCR5~ T cells. d-f, Interaction 
between His- LukE, LukD, or LukED and HA-CCRS (d), with or without MVC 
(5 ug ml’) (e), CCL5, CCL4, CCL3 (10 Lig ml’) (f) and monoclonal 
antibodies 45531, 3A9, CXCR4 (35 pg ml ') (f). In f, E stands for the LukE 
toxin subunit. Immunoblots are representative of at least two independent 
experiments. g, Interaction of LukE with CCR5 and CXCR4 by surface plasmon 
resonance. Representative sensorgrams (g) of two experiments performed in 
duplicate are shown. Where relevant, means + s.d. (n = 3) are shown. 


CCLS5, and was completely blocked upon addition of maraviroc, simi- 
lar to CD4*CCR5. T cells (Fig. 2c). To determine whether LukED 
interacts with CCR5, pull-down assays were conducted with purified 
toxin and solubilized CCR5. We found that CCR5 interacted with 
LukE but not LukD (Fig. 2d). This interaction was significantly 
reduced in the presence of maraviroc, natural ligands of CCRS, as 
well as monoclonal antibody 45531 directed against ECL-2, but not 
3A9 directed against the N terminus of CCRS (Fig. 2e, f). Additionally, 
incubation of LukE (75-fold molar excess) with CCR5* cells largely 
blunted native ligand-induced CCRS signalling as measured by cal- 
cium mobilization (Supplementary Fig. 5). LukE itself does not seem 
to induce CCRS signalling (Supplementary Fig. 6a, b). Surface plas- 
mon resonance studies with immobilized native CCR5 (ref. 15) and 
purified LukE or LukD subunits confirmed the pull-down studies 
and determined that LukE, but not LukD, binds to CCR5 in a 
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time-dependent and saturable manner, with an apparent dissociation 
constant (Kq) of 39.6 + 0.4nM (Fig. 2g and Supplementary Fig. 7a, b). 
This interaction was specific, as evidenced by an inability of LukE to 
bind native CXCR4 (Fig. 2g). 


LukED kills CCR5* myeloid cells and T cells 


We next sought to determine the subsets of primary human lymphoid 
and myeloid cells targeted by LukED. Treatment of blood lympho- 
cytes with LukED resulted in specific depletion of CCR5~ T cells, 
most of which were effector memory T lymphocytes (Fig. 3a and Sup- 
plementary Fig. 8). As with cell lines, the CCR5-dependent killing of 
primary cells was completely blocked by maraviroc (Fig. 3a and 
Supplementary Fig. 8). A proportion of individuals of Northern Euro- 
pean heritage harbour a 32 base-pair deletion in the CCR5 gene (A32 
CCRS), resulting in a truncated protein that cannot be surface loca- 
lized, thus rendering the CD4* T cells refractory to HIV infec- 
tion'''*"”, Similarly, primary T cells expanded from a A32 CCR5 
donor were also resistant to LukED cytotoxicity (Fig. 3b). In keeping 
with the notion that CCRS is required for HIV-1 entry into CD4* 
T cells’"', selective depletion of CCR5* T cells by LukED suppressed 
HIV-1 spread (Supplementary Fig. 9). 

Memory T cells can be classified into functional subsets on the basis 
of differential chemokine receptor profiles and cytokine production. 
Among T-cell subsets, the CCR6*CCR5* subset produces more 
interleukin (IL)-17 and interferon (IFN)-y than CCR6*CCR5~ 
T cells’*. Consistent with this association, depletion of CCR5*CD4* 
T cells with LukED greatly reduced the proportion of IFN-y- and 
IL-17-producing cells compared with purified CD4* T-cell controls 
(Fig. 3c, day 0). Incubation with the yc-cytokines IL-7 and IL-15 
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Figure 3 | LukED kills CCR5* human memory T cells, macrophages and 
dendritic cells. a, Total CCR5* primary human T cells (CD3*/CCR5*) 
incubated with media, LukED (2.5 pig ml ') or maraviroc (MVC; 100 ngml- *) 
followed by LukED treatment. b, Susceptibility of T cells isolated from a A32- 
CCR5 or WT-CCRS donor. Cell viability and CCR5 expression evaluated by 
flow cytometry as in a. c, Cytokine production of CD4* T cells with or without 
LukED treatment (5 1g ml ~') that were stimulated on day 0 with PMA and 
ionomycin (P + J; top panel) or cultured in media supplemented with IL-7/IL- 
15 (20ng ml ') for 7 days followed by stimulation with P+I (bottom panel). 
d, Viability of monocyte-derived macrophages and dendritic cells incubated 
with LukED (3.0 pg ml”! with or without MVC). For FACS plots (a-c), a 
representative from one of three independent donors is shown. Bar graphs, 
mean = s.d. of results from three independent donors. 


ARTICLE 


significantly enhances the proportion of IL-17* and IL-17"/IFN-y~ 
by CCR6* memory T cells'®. We found that when human CD4" T cells 
were first treated with LukED, followed by 7 days of culture with IL-7 
and IL-15, there was a substantial reduction in the induction of IFN-y 
and IL-17/IL-22-secreting CCR6* T cells (Fig. 3c). This finding cor- 
relates well with depletion of the CCR6*CCR5* memory progenitor 
subset (Supplementary Fig. 10). In addition to Thl and Th17 effector 
cells, LukED also killed macrophages and dendritic cells in a CCR5- 
dependent manner (Fig. 3d). 


LukED targets CCR5* cells in vivo 


Next we examined the contribution of CCRS to S. aureus pathogenesis 
and determined the influence of LukED on the targeted killing of 
CCRS* cells in vivo. We found that murine CCR5 (mCCRS) renders 
transfected 293T cells fully susceptible to the toxin (Supplementary 
Fig. 11a, b). Additionally, primary murine macrophages treated with 
high concentrations of maraviroc were partly protected from toxin- 
mediated killing, confirming that LukED is directly targeting mCCR5 
(Supplementary Fig. 11c). Because maraviroc is potent towards human 
CCRS5 but not mCCRS3 (Fig. le and Supplementary Fig. 11)”°, we chose 
to study wild-type (WT) and CCR5-deficient mice with the hypothesis 
that the latter would be resistant to LukED cytotoxicity. S. aureus- 
elicited lymphocytes and macrophages from WT mice were highly 
susceptible to purified LukED, whereas lymphocytes and macrophages 
isolated from CCR5 ’~ mice were markedly resistant (Fig. 4a, b). To 
validate further that S. aureus kills CCR5* leukocytes in vivo, we 
implemented a peritonitis model in which WT and CCR5 ’ mice 
were infected with S. aureus. CCR5 surface expression was not required 
for the initial influx of immune cells to the infection site, as the cells 
recovered and their profiles were identical among all mice (Sup- 
plementary Fig. 12). However, lymphocytes and macrophages elicited 
in vivo in WT mice were more susceptible to S. aureus killing than 
those from the CCR5 ’~ mice (Fig. 4c, d). LukED is associated with 
S. aureus pathogenesis in a murine model of systemic infection’. Using 
this model, CCR5 ’~ mice infected with WT S. aureus exhibited 
significantly reduced bacterial burden in the kidneys than those of 
infected WT mice (Fig. 4e), a phenotype similar to that observed 
for mice infected with a S. aureus AlukED mutant’. After 96h, in- 
fected CCR5 ’~ mice also exhibited significantly reduced serum 
pro-inflammatory cytokines and chemokines and showed a commen- 
surate reduction in innate immune cells in the kidney compared with 
WT mice (Fig. 4f, g), signs consistent with infection resolution. Addi- 
tionally, when WT mice were challenged systemically with WT or a 
AlukED mutant, we observed LukED-dependent killing of CCR5* 
macrophages in infected kidneys, consistent with our hypothesis that 
LukED is capable of targeting CCR5* leukocytes during infection 
(Fig. 4h). In support of the importance of CCRS5 targeting in vivo, 
the mortality associated with S. aureus bloodstream infection was 
reduced for CCR5-deficient mice, a phenotype similar to that of mice 
challenged with strains of S. aureus lacking IukED (Fig. 4i). 


Discussion 


To our knowledge, CCRS is the first described cellular receptor that 
is necessary and sufficient for the killing of mammalian cells by a 
staphylococcal bi-component leukotoxin. Thus, in addition to HIV, 
Toxoplasma gondii and poxviruses (vaccinia and myxoma)”?!™, 
S. aureus can also exploit CCR5 to target immune cells. Interestingly, 
the A32 allele of CCR5 is thought to have been acquired through 
selective pressure imparted by a deadly pathogen**”*. Yersinia pestis 
or variola virus were postulated as potential driving forces behind this 
selection, but these hypotheses have either been discounted or remain 
uncertain in favour of an older selection event incited by an immune- 
cell-targeting pathogen™*”’. Our findings put forth the possibility that 
resistance to S. aureus leukotoxins may have influenced the selection 
of the A32 allele. 
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Figure 4 | CCR5* cell killing is important for S. aureus pathogenesis. 
a, b, Viability of primary murine peritoneal-elicited immune cells from R5 
(n= 3) or RS” (n= 3) mice after incubation with LukED (10 ug ml’). 
¢, d, In vivo viability of recruited immune cells from R5t/* (n = 10) or RS /- 
(n = 10) mice challenged with live S. aureus Arot. e, Bacterial colony-forming 
units (CFU) recovered from the kidneys of R5*'* (n=8) orR5 / (n=9) 
mice infected for 96h with WT S. aureus. f, Serum cytokine and chemokine 
amounts from animals in e. g, Quantification of neutrophils and macrophages 


+/+ 


The finding that LukED selectively kills CCR5* T cells, macro- 
phages and dendritic cells extends the repertoire of immune cells 
targeted by this leukotoxin and supports a role for these leukocytes 
in the resolution of S. aureus infection. The lukED gene is believed to 
be present in many clinically relevant strains (>70%) including clones 
responsible for most infections in the USA and Germany, although it 
is absent in a subset of strains causing hospital-acquired infection (for 
example EMRSA15/16) in the UK**°°. Most isolates lacking IukED 
seem to be of clonal complex 30 (USA200/EMRSA16), which is 
known to produce low amounts of cytotoxins*'. Conceivably, the 
pathogenesis of these strains is influenced by the weakened immune 
status of hospitalized patients rather than toxic molecules. In contrast, 
we predict that virulent clinical strains producing large amounts of 
LukED (for example, clonal complex 8)’ use the toxin to eliminate 
antigen-presenting cells as well as S. aureus-specific CCR5* Th1/ 
Th17 cells, which are induced by the bacterium* and are protective 
against infection*’**. In support of this hypothesis, we demonstrate 
that LukED kills CCR5™ cells in vivo during systemic infection and 
that mice lacking CCR5 are protected from the mortality associated 
with acute S. aureus disease. Current systemic. murine infection mod- 
els are insufficient to evaluate reliably CCR5™ T-cell susceptibility to 
LukED (data not shown). However, our in vitro data and in vivo 
studies with CCR5* macrophages strongly support the notion that 
subsets of CCR5"™ T cells are also targeted in vivo. 

Interestingly, LukED-mediated toxicity towards neutrophils and 
monocytes is not blocked by maraviroc (data not shown), suggesting 
LukED targets these cells through alternative and non-redundant 
mechanisms. This point also implies a role for CCR5* myeloid cells 
and T cells in resolving acute infection, one that extends beyond the 
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recovered from infected kidneys 96h after infection. h, In vivo viability of 
recruited macrophages from R5*!* mice challenged with S. aureus WT 

(n= 10) or AlukED (n = 10). i, ‘Survival’ of R5*’* mice infected with WT 

S. aureus (n = 10) or a AlukED mutant (n = 10) and R5 “infected with WT 
S. aureus (n = 20). FACS plots show a representative from one of 10 infected 
animals. *P < 0.05; **P = 0.001; ***P = 0.0001 by one-way analysis of 
variance (a, b), Student’s t-test (c-h) and Mantel-Cox test (i). Bar graphs, 
mean + s.d. 


initial control of infection imparted by neutrophils. The finding that 
LukED toxicity towards CCR5* cells is potently neutralized by a 
clinically approved CCR5 antagonist (maraviroc) suggests that these 
types of drug could provide much-needed therapeutic alternatives in 
the treatment of S. aureus infections. 


METHODS SUMMARY 


Cell lines and primary human cells were maintained in RPMI plus 10% fetal 
bovine serum with penicillin and streptomycin, unless supplemented as other- 
wise indicated, and were incubated with LukE, LukD or LukED as previously 
described’. All blood samples were obtained from anonymous healthy donors as 
buffy coats (New York Blood Center). The New York Blood Center obtained 
written informed consent from all participants. Animal experiments were per- 
formed in accordance with the Institutional Animal Care and Use Committee at 
New York University School of Medicine. CCR5-overexpressing cell lines and 
CCRS5 shRNA knockdowns were generated by lentiviral-based tranduction as 
previously described*’. Isolation of human peripheral blood mononuclear cells 
(PBMC) and their sorted subsets was performed as previously described”. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Cell culture conditions and viruses. Mammalian cells were maintained at 37 °C 
with 5% CO, in RPMI supplemented with 10% fetal bovine serum (FBS; Atlanta 
Biologicals) and penicillin (100 U ml ')and streptomycin (0.1 mg ml — 1) (Media- 
tech) unless stated otherwise. Lentivirus-based overexpression and knockdown of 
human CCR5 were conducted according to previously described transduction 
methods"’. Virus stocks were produced by DNA transfection mediated by cal- 
cium phosphate as described**. CCR5 overexpressing and shRNA-encoding 
viruses, including non-coding shRNA or HSA (mCD24)-overexpressing con- 
trols, were used at a multiplicity of infection of 1-3. HIV-R5 virus used for 
infection of primary T cells was used at a multiplicity of infection of 0.3. 
Isolation of human PBMC, T-cell purification and activation. Blood was 
obtained from de-identified, consenting healthy adult donors as Buffy coats 
(New York Blood Center) and from A32/A32 CCR5 donors. Human peripheral 
blood mononuclear cells (PBMCs) were isolated from blood using a Ficoll-Paque 
PLUS (GE Amersham) gradient. Resting CD4* and CD8* human T cells were 
purified as previously described’. Briefly, CD4* and CD8"* T cells were isolated 
from purified PBMCs using Dynal CD4* or CD8* Isolation Kits (Life 
Technologies) and were more than 99% pure. To purify naive, central memory 
and effector memory subsets, isolated CD4* and CD8° cells were stained with 
CCR7 and CD45RO antibodies, and CD45RO- CCR7* (TN), CD45RO"CCR7* 
(TCM), CCR7 (effector memory T lymphocyte) subsets were sorted using a flow 
cytometer (FACSAria; BD Biosciences). In some experiments, total CD45RO* 
(Ty) cells were sorted into CCR5* and CCR5~ subsets. Sorted subsets were more 
than 98% pure. Primary human CD4* T cells for HIV-RS infections were activated 
using anti-CD3/CD28 coated beads (Dynabeads, Invitrogen) and maintained in 
RPMI + penicillin and streptomycin + 10% FBS supplemented with 200 U ml! 
IL-2 and 2 mM 1-glutamine (Mediatech). In some experiments, CD4* T cells were 
cultured in 20ng ml’ IL-7 plus IL-15 (R&D Systems) for 7 days. All experiments 
with primary PBMCs from WT CCRS5 donors were performed with cells from 
at least three independent donors. Experiments using A32 CCR5 PBMCs were 
performed with cells from two donors. 

Generation of primary human monocyte-derived macrophages, and dendritic 
cells. Monocyte-derived macrophages and dendritic cells from healthy donors 
were generated from CD14" cells as previously described**. Monocyte (CD14*) 
cells were isolated from PBMCs using anti-CD14 antibody-coated bead-based 
sorting using AutoMACS (Miltenyi Biotec) and were typically more than 99% 
pure. Monocyte-derived macrophages were generated from CD14" cells by sup- 
plementing the culture medium with human granulocyte macrophage colony- 
stimulating factor (50 ng ml')**. Monocyte-derived dendritic cells were generated 
from CD14" cells by supplementing the culture medium with human granulocyte 
macrophage colony-stimulating factor (50ngml~') + IL-4 (40ngml_')*”. Cells 
were cultured for 5 days in the differentiation condition, followed by addition of 
LukED as already described. 

CCRS ligands and inhibitors. Maraviroc and TAK-779 were obtained through 
the AIDS Research and Reference Reagent Program, Division of AIDS, NIAID, 
NIH. Vicriviroc was purchased from Selleck Chemicals. Recombinant human 
Rantes (CCL-5) and macrophage inflammatory protein-1f (MIP-1B, CCL-4) 
were obtained from R&D Systems. Macrophage inflammatory protein 1a (MIP- 
1a, CCL-3) was obtained from Biolegend. Maraviroc was used at 100ng ml’ 
unless otherwise indicated. 

FACS analysis. Cells were stained as previously described’*. For intracellular 
staining, CD4* T-cell cultures were stimulated for 5h at 37 °C with PMA, iono- 
mycin and Golgistop (BD Biosciences). Stimulated cells were washed with PBS 
and stained with Fixable Viability Dye to gate on live cells. Cells were then 
fixed and permeabilized by a commercially available intracellular staining kit 
(eBioscience) according to the manufacturer’s protocol. All FACS data were 
acquired on an LSRII flow cytometer (BD Biosciences) using FACSDiva software. 
Data were analysed using Flowjo software (Treestar). 

Antibodies and dyes. Antibodies used for surface and intracellular staining of 
primary human cells included the following: CD3-Percp Cy5.5 (clone UCHT1), 
CD4-Alexa700 (clone OKT4), CD8-Pacific Blue (clone RPA-T8), CXCR3-Percp 
Cy5.5 (clone G025H7), IL-17-Alexa488 (clone BL168), IFN-y-Alexa700 (clone 
4S.B3) (Biolegend), CD45RO-PeCy7 (clone UCHL1), CCR6-biotin (clone 11A9), 
CCR4-PE (clone 1G1), CCR5-PE (clone 2D7) or CCR5-APC-Cy7 (clone 2D7), 
streptavidin-APC, HSA-PE (clone M1/69) (BD Biosciences), and CCR7-FITC 
(clone 150503) (R&D systems), IL-22- PerCP-eFluor710 (clone 22URTI) 
(eBioscience). 

Antibodies used for surface staining of primary murine cells included the fol- 
lowing: CD3e-APC (clone 145-2C11), CD11b-PeCy7 (clone M1/70), CD11b- 
FITC (clone M1/70) Ly6G-FITC (clone 1A8), Ly6G-PE (clone 1A8), CCR5- 
biotin (C34-3448), CD16/CD32 Fc Block (clone 2.4G2) (BD Biosciences), F4/ 
80-APC (clone BM8), F4/80-PeCy7 (clone BM8) streptavidin-PerCP.Cy5.5, and 


B220-A700 (clone RA3-6B2) (Biolegend). Fixable viability dyes eFluor-450 and 
eFluor-780 were obtained from eBioscience. 

Antibodies used for LukE-CCR5 interaction mapping included the following: 
CCRS clones 45533, 45529, 45531, 45517, 45523, 45549, 3A9, 45502 and 45519 
(ref. 14) (R&D systems). The control CXCR4 antibody used in these studies was 
clone 44716 (R&D systems). 

Leukotoxin treatments. Jurkat, H9, Hut-R5 and GHOST cell lines, primary 
human PBMCs and their sorted subsets, as well as primary murine peritoneal- 
elicited cells, were incubated with LukE, LukD or LukED as previously described’. 
In all experiments cells were seeded into a 96-well plate (1 X 10° to 2 X 10° cells 
per well), treated for 1h at 37°C and evaluated for morphological changes and 
ethidium bromide (EtBr) uptake by microscopy, or viability using CellTiter 
(Promega), CytotoxOne (Promega), cell scatter by FACS and staining with com- 
mercial viability dyes (eBioscience). CellTiter, CytotoxOne and EtBr measure- 
ments were made using an EnVision 2103 Plate Reader (Perkin-Elmer). 
Intoxications were done in the presence of specific inhibitors (maraviroc, 
TAK-779 and vicriviroc), chemokines (CCL3, CCL4, CCL5) or monoclonal 
CCR5 and CXCR4 antibodies where indicated in the text. 

S. aureus in vitro infection experiments. S. aureus (Newman) Arot, and Arot 
AlukED’, were subcultured for 5h in tryptic soy broth followed by washing in 
RPMI plus 10% FBS and normalization to 1 X 10° CFU per millilitre in this same 
media. Normalized bacteria were then added to 2 X 10° Jurkat and Jurkat-R5 cells 
(multiplicity of infection 10:1) that had been pre-stained with «-CCR5-PE 
antibody (clone 2D7) and mixed at a ratio of 50:50. Staining of CCR5 with a- 
CCRS5-PE antibody (clone 2D7) was previously determined to be stable for longer 
than 6h on the surface of Jurkat-R5 cells yet did not influence the killing of these 
cells by LukED (data not shown and Supplementary Fig. 13). Infected cells were 
incubated at 37 °C + 5% CO, for 4h followed by the addition of lysostaphin to kill 
all bacteria. Samples were then analysed on a BD LSRII flow cytometer. Depletion 
of CCR5* compared with CCR5~ cells was evaluated and shown graphically as 
the percentage of dead cells relative to controls with no toxin. For studies with 
maraviroc, the inhibitor was added to cells 30 min before the addition of bacteria 
as described above. Experiments were conducted three times in triplicate. 
Generation of GFP fusion proteins. To generate recombinant N-terminal 
Hiss-GFP-tagged LukE and LukD, the mature protein coding sequences of 
LukE and LukD from S. aureus Newman genomic DNA were PCR-amplified 
using the following primers: IukE-F-Sall (5’-CCCC-GTCGAC-AATACTAA 
TATTGAAAAT-3’), lukD-F-Sall (5’-CCCC-GTCGAC-GCTCAACATATCA 
CA-3'), lukE-R-Notl (5’-CCCC-GCGGCCGC-tta-ATTATGTCCTTTCACTT 
TAATTTCGTG-3’) and IukD-R-Notl (5'-CCCC-GCGGCCGC-tta-TACTCC 
AGGATTAGTTTCTTTAGAATC-3’). Amplified sequences were subcloned 
into pET-41b (Novagen), resulting in a fusion of Hiss-GFP with the N terminus 
of mature LukE or LukD. Recombinant plasmids were transformed into 
Escherichia coli DH5« and transformants selected by kanamycin resistance. 
Positive clones were transformed into E. coli LysY/LacQ (New England 
BioLabs) for protein expression and purification. 

Leukotoxin purification. LukE, LukD, GFP-LukE, GFP-LukD, LukS, LukF, 
LukA and LukB were purified from E. coli LysY/LaqQ as previously described*’ 
followed by endotoxin removal with Detoxi-Gel Endotoxin Removal Gel 
(Thermo Scientific). The following alterations were made for purification of 
recombinant GFP-LukE and GFP-LukD. Upon sonication of bacterial cell pel- 
lets, lysates were incubated with 1% Triton X-100 for 1h at room temperature. 
After incubation, lysates were centrifuged for 60 min at 12,350g and passed 
through a 0.22 tm filter before completing the purification protocol as described’. 
LukED membrane association studies. Association of LukED with the surface 
of CCRS™ cells was measured as follows. A toxic dose of purified recombinant 
GFP-LukE or GFP-LukD with LukD or LukE, respectively, (final concentration 
101g ml~') was incubated for 30min on ice with sorted CD4*CCR5* or 
CD4*CCR5~ T cells (5 X 10* cells per well) from three independent donors. 
Cells were gated as GFP positive compared with baseline fluorescence of 
untreated cells. A total of 50,000 events were collected in all conditions tested. 
Owing to the high amount of background fluorescence of GFP toxins with the 
membranes of transduced cell lines, we were unable to use these cells for mem- 
brane association assays (data not shown). As an alternative, we used primary 
CD4* T cells for membrane association studies. To increase the abundance of 
CCRS on these cells and foster reproducible measures of membrane association, 
CD4*CCR5* cells were generated from CD4™ cells infected with a lentivirus 
encoding CCRS and sorted by FACS as CCRS* from the resulting CD4* popu- 
lation after surface staining for CCR5 using 2D7 clone (PE). CD4"CCRS cells 
were sorted from the same population as those cells with undetectable CCR5 
surface expression. CCR5 surface staining with 2D7 antibody does not influence 
toxin killing kinetics and therefore is unlikely to adversely influence membrane 
association, as the latter is required for the former (Supplementary Fig. 13). 


©2013 Macmillan Publishers Limited. All rights reserved 


Paradoxically, clone 2D7 also binds to ECL-2 of CCRS5 similar to that of clone 
45531, which blocks toxin activity. However, 2D7 and 45531 do bind to distinct 
portions of ECL-2 (the N-terminal portion and carboxy (C)-terminal portion, 
respectively) perhaps explaining this phenomenon”. Alternatively, our staining 
protocols may not have sufficiently saturated all receptor sites, thereby allowing 
functional characterization of toxin in the presence of 2D7. 

Experiments assessing maraviroc, natural ligand or antibody inhibition of 
LukED membrane association were conducted in a similar fashion. However, 
in these instances cells were first pre-incubated for 30min with maraviroc 
(100 ng ml '), CCLS5 (5 Lug ml — '), 3A9, 45531 or CXCR4 monoclonal antibodies 
(25 wg ml 1) or buffer before addition of a lethal concentration of LukE- 
GFP + LukD to the cells (5-10 pg ml '). After treatment, cells were washed, 
re-suspended in fixation buffer (FACS buffer + 2% paraformaldehyde) for 
15 min at room temperature, washed again, re-suspended in FACS buffer, and 
the fluorescence of bound toxin was monitored by flow cytometry. Cells are 
shown as the percentage that were GFP positive. 

Surface plasmon resonance analysis of LukE and LukD binding to solubilized 
CCRS5 and CXCR4. Binding kinetics of LukE and LukD to CCR5 and CXCR4 by 
surface plasmon resonance were measured as previously described’**”’. This 
approach has also been used to detect ligand interactions with CXCR1 and 
CXCR2 (refs 43, 44). A C9-tagged CCRS5 was solubilized using 50 mM HEPES, 
pH 7.0, 150 mM NaCl, 0.1% DDM, 0.1% CHAPS, 0.02% CHS". This solubiliza- 
tion scheme is known to retain conformationally specific antibody binding to 
both CCR5 and CXCR4 (ref. 15). Approximately 700 relative units (RU) of the 
CCRS receptor was captured onto a 1D4 antibody-bound CM5 chip'**°*". Cells 
expressing a C9-tagged CXCR4 receptor were also solubilized as a control surface 
in the same buffer*!. C9-CXCR4 was captured to approximately 1,200 RU. LukE 
or LukD was diluted to 1.7 1M in running buffer containing 50 mM HEPES, 
pH 7.0, 150 mM NaCl, 0.02% CHS, 0.1% DDM and 0.1% Chaps and tested for 
binding in a threefold dilution series at a flow rate of 50 pl min” *. Each concen- 
tration series was replicated twice as shown by the overlaid sensorgrams. All data 
were collected at 25 °C and conducted at least twice in duplicate. 

Biochemical studies to detect interactions between LukED and CCRS. 293T 
cells were transfected with a vector containing HA-tagged CCR5 (Missouri S&T 
cDNA Resource Center; www.cdna.org), followed by solubilization (approxi- 
mately 2.0 X 10’ cells per condition) in PBS + 1% Brij010 + Complete EDTA- 
free protease inhibitor cocktail (Roche). Solubilized CCR5 was then added to 25 kl 
of nickel resin containing no toxin or bound LukE, LukD or LukED. For the 
maraviroc, natural ligand and antibody inhibition experiments, the solubilized 
CCRS was pre-incubated for 30min at room temperature with 5gml | of 
maraviroc, 10pgml-' of each chemokine or 35,1gml~’ of each antibody 
followed by incubation with nickel resin containing LukE. After incubation 
with cell lysates, the resin/protein complexes were fixed with 2mM DTSSP 
(Pierce) for 30 min, quenched with 20 mM Tris pH 8.0 for 15 min, washed four 
times in PBS + 1% Brij010 and boiled in 4X SDS boiling buffer. All samples were 
run ona 10% SDS-PAGE gel at 80 V, followed by transfer to nitrocellulose at 1 A 
for 1h. Membranes were blocked in PBS + 0.01% Tween + 5% milk for 1h and 
incubated overnight with either «-HA antibody for CCR5 (Covance) or «-His 
antibody (Cell Sciences) for LukE and LukD. The following day, secondary goat 
a-mouse-HRP antibody (Bio-Rad) was added to the membranes for 1 h followed 
by the addition of SuperSignal West Femto Maximum Sensitivity Substrate 
(Thermo Fisher Scientific) for detection. 

Measurement of CCRS activation by calcium mobilization. CCRS5 activation by 
calcium mobilization in cell lines and primary cells was assessed using the com- 
mercial dye Fluo4-AM (Invitrogen). Cells were labelled for 30 min at room tem- 
perature with 3 1M Fluo4 in Hanks’ balanced salt solution, followed by three 
washes in Hanks’ balanced salt solution and incubation at 37 °C for 30 min. Cells 
were analysed on a flow cytometer over time and, at 100, ligand (CCL3, CCL4, 
CCLS, 10 ng ml ~ ') or LukE (10-20 ug ml!) was added to the cells. Fluorescence 
was monitored thereafter by flow cytometry (500 events were collected per 
second) until the indicated completion of each experiment. For conditions in 
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which inhibition of receptor activation was monitored, cells were pre-incubated 
with either maraviroc (1 pg ml!) or LukE (10-20 ug ml‘) during the 30 min 
incubation at 37 °C described above. Graphs show the mean fluorescence of all 
events collected in 5s intervals. 

Murine in vitro and in vivo experiments. In vitro assessment of peritoneal- 
elicited immune cell killing by LukED was conducted as follows. Female age- 
matched (4-6 weeks) C57BL/6 WT or CCR5/~ mice (Taconic) were injected 
with 1 X 10’ CFU of heat-killed S. aureus Newman AlukED intraperitoneally. 
Twenty-four hours later, mice were injected with an additional 1 X 10’ CFU of 
the same strain. After another 24h, mice were killed and peritoneal-elicited 
immune cells were lavaged with 7 ml of PBS followed by lysis of red blood cells 
in ACK lysing buffer and re-suspension in RPMI + 10% FBS. LukED was then 
added to cells as described above and incubated for 1h at 37 °C with 5% CO. 
After incubation, cells were washed in PBS and stained with viability dye followed 
by surface staining for B220, CD11b, F480, Ly6G, CD3 and CCRS. The percen- 
tages of dead cells shown are an average of cells isolated and intoxicated from 
three independent mice. Means and standard deviations are shown. 

For experiments designed to measure S. aureus killing of CCR5* cells in vivo, 
female age-matched (4-6 weeks) C57BL/6 WT or CCRS5 /~ mice (Taconic) were 
injected on day 1 with 1 10” heat-killed S. aureus to promote the recruitment of 
CCRS5* macrophages and lymphocytes to the peritoneum. On day 2, mice were 
challenged with live S. aureus Arot followed by the isolation of peritoneal immune 
cells 16-20 h later. Isolated cells were processed for FACS as described above and 
the viability of lymphocytes and macrophages was evaluated. The percentages of 
dead lymphocytes were averaged from 10 WT and 10 CCR5 /~ animals; repre- 
sentative FACS plots are shown. 

For murine systemic infections, female age-matched (4-6 weeks) C57BL/6 WT 
or CCR5~/~ mice (Taconic) were infected with WT S. aureus Newman as previ- 
ously described’. After 96h, serum was collected and kidneys removed, homo- 
genized, processed for FACS and plated as previously described’. All survival 
curves were conducted as previously described using WT S. aureus Newman 
and an isogenic AlukED mutant’. For flow cytometry of immune cells from 
WT or AlukED infected kidneys, organs were removed after 96 h and mechani- 
cally homogenized. Immune cells in homogenized tissues were enriched by 
performing a 40/80 Percoll (GE Healthcare) density gradient centrifugation. 
Cells were subsequently processed for surface and viability staining thereafter 
(see above). 
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Structure of a presenilin family 
intramembrane aspartate protease 


Xiaochun Li!?*, Shangyu Dang’**, Chuangye Yan”, Xingi Gong’, Jiawei Wang”? & Yigong Shi? 


Presenilin and signal peptide peptidase (SPP) are intramembrane aspartyl proteases that regulate important biological 
functions in eukaryotes. Mechanistic understanding of presenilin and SPP has been hampered by lack of relevant structural 
information. Here we report the crystal structure ofa presenilin/ SPP homologue (PSH) from the archaeon Methanoculleus 
marisnigri JR1. The protease, comprising nine transmembrane segments (TMs), adopts a previously unreported protein 
fold. The amino-terminal domain, consisting of TM1-6, forms a horseshoe-shaped structure, surrounding TM7-9 of the 
carboxy-terminal domain. The two catalytic aspartate residues are located on the cytoplasmic side of TM6 and TM7, 
spatially close to each other and approximately 8A into the lipid membrane surface. Water molecules gain constant 
access to the catalytic aspartates through a large cavity between the amino- and carboxy-terminal domains. Structural 
analysis reveals insights into the presenilin/SPP family of intramembrane proteases. 


Signalling through regulated intramembrane proteolysis (RIP) is ubi- 
quitously conserved from bacteria to humans". The central step of RIP 
signalling is cleavage of a target protein within the lipid bilayer by a 
membrane-embedded protease**. These intramembrane proteases 
fall into three distinct families: serine protease rhomboid, metallopro- 
tease site-2 protease (S2P), and aspartyl proteases exemplified by pre- 
senilin and SPP. 

Presenilin is the catalytic component of y-secretase*°, which is 
responsible for the generation of amyloid-$ peptide from amyloid 
precursor protein (APP)’. An altered cleavage pattern of APP, invol- 
ving aberrant accumulation ofa 42-residue amyloid-f peptide (AB42) 
over a 40-residue product (AB40), underlies the development of 
Alzheimer’s disease*’. Besides APP, y-secretase has a number of other 
important substrates such as Notch*®. Accelerated cleavage of Notch 
by y-secretase leads to excessive signalling, contributing to certain 
cancers’. Suppressing the generation of AB42 without affecting cleav- 
age of Notch represents an attractive strategy for therapeutic inter- 
vention in Alzheimer’s disease. SPP is required for degradation of the 
signal peptide in membrane-targeted proteins. Presenilin and SPP 
share extensive sequence homology, both with the signature motifs 
@®YD©® (in which ® denotes a hydrophobic residue) on TM6 and 
®GOGD on TM7 (ref. 10). 

In addition to presenilin, the y-secretase complex contains three 
other essential components, PEN-2, APH-1 and nicastrin'’. In this 
complex, PEN-2 is thought to interact directly with presenilin’””’, 
which comprises two auto-proteolysed fragments known as NTF 
(TM1-6) and CTF (TM7-9)"*. As the central component, presenilin 
also associates with APH-1 and nicastrin'*’®. The central role of 
presenilin in the y-secretase complex is manifested by identification 
of over 150 disease-derived mutations in presenilin’. 

Recent structural investigations of the prokaryotic homologues of 
rhomboid'””° and S2P”' have facilitated mechanistic understanding 
of these membrane-embedded proteases. By contrast, detailed struc- 
tural information on presenilin and SPP has been extremely slow to 
emerge. The limited structural information includes electron micro- 


scopic analysis of the y-secretase complex’, an NMR analysis of 


presenilin CTF’, and a crystal structure of a GXGD peptidase FlaK”® 
which shows no sequence homology to presenilin/SPP. 


Engineering and crystallization of PSH 


We cloned human presenilin 1 (also known as PSEN1, hereafter referred 
to as PS1) and its homologues in several organisms. Extensive effort 
aimed at generation of functional recombinant presenilin was only 
successful at the level of protease activity detection; such recombinant 
proteins were unfit for biophysical characterization. After years of effort 
on the eukaryotic presenilin, we sought its archaeal homologues”. Of the 
13 archaeal presenilin homologues, only two were successfully overex- 
pressed and purified. We focused on PSH (also known as MCMJR1”*), 
which gives a higher yield and shares extensive sequence homology with 
PS1. Unfortunately, the full-length PSH (residues 1-301) exhibits poor 
behaviour in solution in all non-ionic detergents tested and defied all 
attempts at crystallization. 

We designed a sequence-based protein engineering strategy to 
improve behaviour of PSH in solution. Sequence alignment between 
PSH and presenilin homologues from other archaeal species revealed 
a number of positions that contain highly conserved amino acids in all 
other archaeal homologues, except PSH. We reasoned that replace- 
ment of the non-conserved amino acids in PSH by the conserved ones 
may improve its behaviour in solution, but is unlikely to affect its 
folding or protease activity. On the basis of this rationale, we gene- 
rated 20 PSH variants, each containing one or two point mutations, 
and individually examined their protease activity using the trans- 
membrane segment of Gurken as the substrate** (Supplementary 
Fig. 1). Only the protease-active PSH variants were selected for evalua- 
tion of behaviour in solution and possibly crystallization trials. Single 
mutations were combined to generate multiple mutations in the same 
PSH variant (Supplementary Fig. 1). Finally, to increase the likelihood 
of crystallization, we used V8 protease to remove the flexible surface 
loops, which resulted in the removal of residues 182-209. 

Using this approach, we generated a total of 105 PSH variants 
for detailed biochemical evaluation. The PSH variant used for final 
crystallization contains two chains (residues 1-181 and 210-301) and 
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a total of five missense mutations, D40N, E42S, A147E, V148P and 
A229V, which mostly affect residues in the surface loops. These 
sequence modifications have no effect on the protease activity of PSH 
(Fig. 1a, lanes 1-3). By contrast, mutation of the two catalytic residues 
Asp 162 and Asp 220 in PSH abolished the protease activity (lanes 4 
and 5). Similar to the wild-type PSH, the protease activity of this variant 
was efficiently inhibited by a known presenilin inhibitor III-31-C 
(ref. 29; Fig. 1a, lanes 6-14). Intriguingly, the elution volume of the 
PSH variant corresponds to a molecular mass in excess of 100 kilo- 
daltons (kDa), indicating oligomerization (Supplementary Fig. 2a). 
After going through about 160,000 conditions, we eventually generated 
suitable crystals of this PSH variant for X-ray data collection (Sup- 
plementary Fig. 2b, c). 


Structure of PSH 


The PSH variant was crystallized in three space groups. The structure 
in the C222 space group was determined by platinum-based multi- 
wavelength anomalous dispersion (MAD) and refined at 3.3 A resolu- 
tion (Supplementary Tables 1-3 and Supplementary Figs 3-6). The 
atomic model was confirmed by selenium anomalous peaks (Sup- 
plementary Figs 5 and 6). The atomic coordinates of PSH were used 
to determine the structures in space groups C222, and P2 (Sup- 
plementary Table 1). The PSH protein forms a tetrameric complex in 
the C222 crystals (Supplementary Fig. 4). Each asymmetric unit contains 
one tetrameric PSH complex in C222, and two complexes in P2. 

Each PSH molecule comprises nine transmembrane «a-helices 
(Fig. 1b and Supplementary Fig. 7), and is approximately 50 A in 
length, 40 A in width and 40 A in height. The surface loops connecting 
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TM1-TM2 and TM7-TM8 are disordered in the crystals. The amino- 
terminal domain (NTD), comprising TM1-6, forms a horseshoe- 
shaped structure, which partially surrounds TM7-9 of the carboxy- 
terminal domain (CTD; Fig. 1b). Consequently, TM7 of the CTD is 
completely surrounded by other TMs. None of the nine TMs is per- 
pendicular to the lipid membrane. TM1-6 are tilted at angles of 15-35 
degrees away from the surface normal of lipid membrane, and TM7-9 
are tilted to a smaller extent. 

Notably, a large hole traverses through the entire protein and is 
surrounded by TM2, TM3, TM5 and TM7 (Fig. 1b). This hole, formed 
mainly by hydrophobic residues, is large enough to allow passage of 
small ions; it is likely plugged up by lipid molecules and/or other 
interacting protein in cells. The in vitro protease activity of PSH is 
strongly influenced by the choice of phospholipids (data not shown). 
A cavity from the cytoplasmic side reaches the active site aspartates, 
allowing unrestricted solvent access. The catalytic residues Asp 162 
and Asp 220 in PSH are located at the bottom of the cavity, approxi- 
mately 8 A below the lipid membrane surface from the cytoplasmic 
side (Fig. 1b). Owing to the hole and the cavity, some transmembrane 
regions of TM2/TM3/TM5/TM6/TM7 are exposed to solvent. This 
structural finding is consistent with the biochemical observation that 
a number of membrane-embedded residues in TM6 and TM7 of PS1, 
including the catalytic residues Asp 257 and Asp 385, are accessible to 
crosslinking reagents**’’. Similar cavities have also been observed in 
the electron microscopic structure of the y-secretase complex’. 

The topology of PSH is quite different from that of the other two 
families of intramembrane proteases rhomboid'”” and S2P” (Fig, 2a, 
b). Exhaustive search of the protein data bank using DALI”? revealed 


WT WT WT WT Xtal Xtal Xtal Xtal 
0 0 50 100 150 O 50 100 150 


Cytoplasm 


Figure 1 | Overall structure of a presenilin/SPP homologue from the 
archaeon Methanoculleus marisnigriJR1 (PSH). a, The PSH variant used for 
crystallization (denoted ‘Xtal’) is catalytically active. As reported”*, the 
membrane protein Gurken was used as a substrate protein for PSH. The 
proteolytic activity of the PSH variant (Xtal) is similarly inhibited as the wild- 
type PSH by III-31-C, an inhibitor known to be specific for aspartyl proteases 
such as presenilin” (right panel). The concentrations of III-31-C shown are in 


micromolar. b, Overall structure of PSH. The nine TMs are divided into the 
NTD (TM1-6) and CTD (TM7-9). The putative catalytic residues Asp 162 and 
Asp 220 are located on TM6 and TM7, respectively. The structure is rainbow- 
coloured, with N terminus in blue and C terminus in red. The top and bottom 
panels represent two perpendicular views. The structure is shown in cartoon 
and electrostatic surface representation in the left and right panels, respectively. 
Figures 1b, 3 and 4a, c were prepared using PyMol”. 
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Figure 2 | Topology diagrams of PSH and other families of intramembrane 
proteases. a, Membrane topology diagram of PSH. The two putative catalytic 
residues, Asp 162 and Asp 220, are indicated. b, Membrane topology diagram 
of the rhomboid protease GlpG from E. coli and the metalloprotease $2P from 
the archaeon M. jannaschii. The catalytic dyad residues Ser-His of GlpG and 
the three zinc-binding residues of S2P are indicated. 


no entry that is similar to the structure of PSH over its entire nine TMs. 
In fact, no entry shows structural similarity to PSH over four TMs. The 
closest entry (with a similarity Z-score of 4.0) is the structure of Sec15 
C-terminal domain’, which only had 61 Ca atoms aligned to PSH with 
a root-mean-squared deviation (r.m.s.d.) of 3.1 A. This analysis indi- 
cates that PSH may represent a previously unreported protein fold, 
which we would like to name the presenilin fold (Fig. 2a). 


Mes 
f& Pepsin 


Figure 3 | Conformation of the active site. a, A close-up comparison of the 
active site conformation between PSH and pepsin. The two aspartate residues 
in PSH and pepsin are separated by distances of 6.7 A and 3.1 A, respectively. 
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Tetrameric assembly of PSH 


The PSH tetramer contains four hydrophobic holes, each traversing 
through the entire TM region (Supplementary Fig. 8a). Each PSH 
molecule contacts two neighbouring molecules exclusively through 
their membrane-spanning regions. Hydrophobic amino acids in TM2 
and TM6 of one PSH molecule make van der Waals contacts to 
residues of the same type in TM3 and TM4 of an adjacent PSH 
molecule. Notably, four aromatic residues on TM3—Phe 72, Phe 76, 
Phe 79 and Phe 82—are placed on the same side of the o-helix and 
interact with four hydrophobic amino acids, Leu 48, Ile 51, Leu55 and 
Leu 59, all on TM2 of the neighbouring molecule (Supplementary Fig. 
8b). An identical homotetrameric complex of PSH is formed in crys- 
tals of the space groups C222, and P2 (Supplementary Fig. 9). As 
anticipated, the different PSH molecules with the same space group 
or between different space groups are all nearly identical to one another, 
with a pair-wise r.m.s.d. of 0.3-0.6 A over the aligned Cx atoms. 


Active site and accessibility 

Similar to other aspartyl proteases such as pepsin, PSH also contains 
two catalytic residues Asp 162 and Asp 220, located in the intramem- 
brane aspartyl protease signature motifs, VYD,¢6,AI on TM6 and 
MGMGDy.9 on TM7 (Fig. 2a ands Supplementary Fig. 7). The reac- 
tion mechanism requires placement of the two aspartate side chains to 
be within hydrogen-bond distance; in pepsin*, this distance is 3.1 A. 
In the structure of PSH, however, Asp 162 and Asp 220 face each other 
but are separated by a distance of 6.7 A (Fig. 3a). This observation 
suggests that substrate binding may trigger a conformational change 
in PSH that moves the two aspartate residues closer to carry out 
catalysis. Such an induced-fit mechanism might safeguard PSH, a 
membrane-embedded protease, against non-specific cleavages. Accord- 
ingly, we further speculate that the activated conformation of PSH may 
be stabilized by the presence of a substrate or inhibitor. Alternatively, 
although less likely, the extended distance between the two aspartates 
could be caused by crystal packing or other artefacts of in vitro manipu- 
lation. Regardless of these scenarios, the observed inactive conformation 
of PSH is apparently stable under our experimental conditions, and 
additional conformations of the presenilin-like protease may be cap- 
tured by future structural investigation. 


PSH and pepsin are coloured green and grey, respectively. Two water molecules 
in pepsin are shown as red spheres. b, A close-up view of the active site 
conformation in PSH. Four residues surrounding Asp 162 and Asp 220 are shown. 
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Cleavage of the peptide bond requires water. Access of water mole- 
cules to the catalytic aspartates in PSH is unrestricted, due to the 
presence of a large solvent cavity facing the cytoplasm (Fig. 1b). 
This feature is similar to that of rhomboid, where solvent access to 
the active site is provided by a cavity that opens to the extracellular 
side'”-*°. In S2P, however, water access is restricted in the closed 
conformation and only becomes profuse in the open conformation”. 

Analysis of the TM organization reveals two potential routes for 
substrate entry, through the open space either between TM6 and TM9 
or between TM6 and TM2 (Fig. 3b). The first possibility (between 
TM6 and TM9) is supported by experimental evidence. For PS1, TM9 
is thought to be directly involved in substrate binding**, and photo- 
affinity labelling experiments suggest that substrate entry site is 
between NTF and CTF”. Structural consideration also favours this 
possibility, as Asp 162 and Asp 220 are both unobstructed in the 
case of substrate entry from between TM6 and TM9 (Supplemen- 
tary Fig. 10a). By contrast, there is considerable steric hindrance from 
Tyr 161 on TM6 and Met 216/Met 223 on TM7 if substrate proteins 
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were to gain lateral access to the active site from the space between 
TM6 and TM2 (Fig. 3b and Supplementary Fig. 10b). This analysis, 
performed on the inactive conformation of PSH, indicates that sub- 
strate proteins are likely to enter the active site through the space 
between TM6 and TM9. We acknowledge the caveat that such analysis 
should be ideally carried out for the active protease conformation. 


Implication for human presenilin 1 

The primary sequences of PSH and PS1 share 19.3% identity and 
52.8% similarity (Supplementary Fig. 7). The highly conserved amino 
acids among PSH, PS1 and their homologues in Xenopus laevis 
and Drosophila melanogaster represent 48.2% of the total aligned 
sequences (Fig. 4a and Supplementary Fig. 7). Importantly, the sig- 
nature motifs for catalysis are conserved between PSH and PS1. There- 
fore, the structure of PS1 should be very similar to that of PSH, with 
observed key structural features in PSH likely to be preserved in PS1. 
On the basis of sequence alignment, we generated a membrane topo- 
logy diagram for PS1 (Fig. 4b) and built an energy-minimized atomic 


TM4 


Figure 4 | Homology modelling of human presenilin 1. a, Amino acids that 
constitute the core of the PSH structure are highly conserved among PSH, 
human PS1, Xenopus laevis PS1 and Drosophila melanogaster PS1. The 
invariant and highly conserved amino acids in all four proteins are coloured red 
and yellow, respectively. b, A schematic diagram on the predicted membrane 
topology of PS1. Demarcations for all nine TMs are labelled. Catalytic 
aspartates are highlighted in red. Substrate recognition sequence PALP is 


shown in magenta. c, A structure-based model of the transmembrane core of 
human PS1. The atomic model of human PS1 transmembrane core was 
generated as described in Methods. Two perpendicular views are shown. 
Among the dozens of amino acids in PS1 that were targeted for mutations in 
neurodegenerative diseases, 12 representative residues, which are either 
invariant (red) or highly conserved (yellow), are shown here. 


3 JANUARY 2013 | VOL 493 | NATURE | 59 


©2013 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


model for PS1 (Fig. 4c). Comparison of our PS1 model with the NMR 
structure of PS1 CTF” reveals a shared overall topology but consider- 
able differences in TM7 and TM9 (Supplementary Fig. 11). Notably, 
the catalytic residue Asp 385 in the NMR structure, but not in our PS1 
model, points to the opposite direction of the other catalytic Asp 257 
on TM6. 

Greater than 50% of the disease-derived mutations in PS1 target 
highly conserved amino acids among PSH and PS1 homologues 
(Supplementary Fig. 7). We selected 12 representative mutations, 
which affect 8 invariant and 4 conserved amino acids between PSH 
and PS1, and mapped the affected residues onto the PS1 model 
(Fig. 4c). Next, we generated the corresponding mutations in PSH 
and examined the protease activities of these variants (Supplementary 
Fig. 12). These mutations are classified into three categories. The first 
category, including two mutations L108F and F142I (Supplementary 
Fig. 12, lanes 10-13), has little effect on protease activity. The second 
category decreases, but does not abolish, the protease activity of PSH. 
It includes eight mutations in PSH: L17P, N46D, 177P, V151E, L155S, 
G219A, $2251 and V261R (Supplementary Fig. 12, lanes 4-9, 14-20 
and 25-30). The third category, including Y161S and V166F, com- 
pletely abrogated the protease activity of PSH (Supplementary Fig. 12, 
lanes 21-24). 

The effect of the majority of these mutations can be rationalized on 
the basis of PSH structure. For example, the first category mutations 
L108F and F142I, affecting residues in TM4 and TM5 of PSH, are far 
away from the active site and thus are unlikely to be involved in 
substrate binding or catalysis. The third category mutations Y161S 
and V166F affect two amino acids in the immediate vicinity of the 
catalytic residue Asp 162, probably causing pronounced alteration of 
the local conformation in the active site and thus crippling catalysis. 
Three mutations in the second category, including V151E and L155S 
in TM6, and $225] in TM7, may affect the protease activity through 
altered local conformation. 


Perspective 


Although the structure of PS1 can be modelled on the basis of con- 
siderable sequence homology with PSH, their functional aspects share 
few common themes. y-secretase initially produces 48-49-residue 
amyloid-B, followed by successive cleavages of 3-4 amino acids at 
each step to produce a spectrum of peptides*’. The main effect of 
mutations in PS1 is to increase the proportion of long, aggregation- 
prone amyloid-f peptides, exemplified by the AB42/AB40 ratio*?. 
These functional aspects cannot be recapitulated in PSH. In this 
regard, the biochemical characterization of PSH has little implication 
on PS1 or y-secretase. 

A fundamental issue in the investigation of y-secretase complex is 
how the four components are assembled. Our atomic model of PS1, 
together with available information from past studies, reveals insights 
into this important question. The TM4 of PS1 is thought to interact 
with the two TMs of PEN-2 (refs 12, 13). We took an ab initio struc- 
ture prediction approach to generate atomic models of PEN-2 
(Supplementary Fig. 13). One of the top scoring models was docked 
onto TM4 of the PS1 model, involving mostly hydrophobic interac- 
tions. Notably, the binding site for PEN-2 is on the opposite side of 
PS1-CTF, where APH-1 and nicastrin bind. We note that this docking 
model between PS1 and PEN-2 is highly speculative. 

The oligomeric state of y-secretase has been controversial™, with 
evidence for a complex containing one molecule for each of the four 
components***’, or two molecules for PS1**”°, or with an aggregated 
molecular mass of 900 kDa”. A recent electron microscopic study of 
y-secretase revealed an accurate molecular mass of 230 kDa, consist- 
ent with the case of one molecule each for the four components”. 
However, as is the case for membrane protein, the oligomeric state 
and the solution behaviour of y-secretase are likely to depend on the 
choice of solubilizing detergents. Given the high degree of sequence 
similarity between PSH and PS1, it is tempting to speculate that PS1 
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may also form a homo-tetramer. Alternatively, the observed tetrameric 
assembly of PSH could just be an artefact of in vitro manipulation and 
crystallization, or more likely a unique property of PSH. Regardless of 
these different scenarios, it is important to note that presenilin oligo- 
merization is not required for ‘-secretase activity’. 

Bacteria contain a family of membrane-embedded proteases pre- 
flagellin peptidase (PFP) and type-4 prepilin peptidase (TFPP). 
Unlike presenilin/SPP, PFP and TFPP cleave substrate proteins in 
aqueous environment. The structure of the GXGD protease FlaK, a 
PEP, was recently reported”®. Similar to PSH, the two catalytic Asp 
residues are also uncoupled in FlaK. It should be noted, however, there 
is little sequence or structural similarity between PSH and Flak 
(Supplementary Fig. 14). Most notably, PSH, but not FlaK, shares 
strong sequence homology with presenilin/SPP. 

Although the focus of this manuscript is given to PS1, consider- 
able sequence similarity exists between PSH and SPP (Supplementary 
Fig. 15). The structural information of PSH, together with sequence 
alignment, should allow accurate modelling of SPP structure and 
perhaps rationalization of functional data on SPP. For example, the 
identification of the C-terminal TMs as a minimalist functional core 
in SPP* is consistent with the observed domain organization of PSH. 
In fact, the degree of sequence similarity between PSH and human 
SPP is approximately 44%, similar to that between PSH and PS1 
(Supplementary Fig. 15). Consequently, most of the structural fea- 
tures proposed for PS1 are likely to be conserved in human SPP as 
well. These features may include the overall TM organization and the 
locations of the active site aspartate residues. Intriguingly, electron 
microscopic analysis of human SPP reveals a tetrameric assembly” 
that is reminiscent of PSH. 

In summary, structure of the aspartate intramembrane protease 
PSH, the first of its class, suggests important clues on the organization 
of nine TMs in PS1. Gratifyingly, our structural information is sup- 
ported by a body of published experimental evidence. For one instance, 
TM1 of PS1 was shown to participate in the catalytic core structure™; in 
our structure, TM1 directly contacts TM7 and TM8 (Fig. 1b). In this 
regard, our study also serves as a framework for understanding the 
structure and mechanism of presenilin, y-secretase and SPP. 


METHODS SUMMARY 


The recombinant PSH protein was overexpressed in Escherichia coli and purified 
to homogeneity by affinity chromatography and gel filtration. The PSH was 
crystallized by the hanging-drop vapour-diffusion method. Derivative crystals 
were obtained by soaking crystals for 20 min in mother liquor containing 2 mM 
K,PtCly. Diffraction data were collected at the Shanghai Synchrotron Radiation 
Facility (SSRF) beamline BL17U and SPring-8 beamline BL41XU and processed 
with HKL2000 (ref. 44). The experimental phases were generated by Pt-MAD 
using SOLVE*. Multi-crystal averaging with all three available space groups 
(C222, C222), P2) combined with solvent flattening, histogram matching and 
non-crystallographic symmetry (NCS) averaging gave a map of sufficient quality 
for model building. An initial model was built manually using COOT”*. Sequence 
docking was aided with the selenium sites in the Se-SAD anomalous difference 
Fourier map. The structure was refined with PHENIX”. The proteolytic activity 
of PSH was examined by an in vitro cleavage assay using Gurken as the substrate. 
Sequence alignment was carried out using ClustalW**. The human presenilin 
1 (PS1) model was built according to the sequence alignment results with the 
Sculptor” from the PSH coordinates. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Protein engineering. The complementary DNAs of 13 archaeal presenilin 
homologues were individually cloned into pET-21b (Novagen). The recombinant 
proteins were overexpressed in E. coli strain BL21(DE3). After first-round screen- 
ing, the presenilin homologue from archaeon Methanoculleus marisnigri JR1 
(with an N-terminal 8-His tag), named as PSH, became our focused target 
because of its reasonable yield. However, the solution behaviour of the wild-type, 
full-length PSH is poor, and the purified protein is highly prone to aggregation 
and precipitation. We embarked on a systematic journey designed to improve the 
solution behaviour of PSH. Sequence alignment of PSH with other sequence 
homologues in bacteria and eukaryotes revealed a large number of amino acids 
that are conserved in other homologues, but not in PSH. These amino acids were 
targeted for individual mutation (by conserved amino acids) and the variants 
were evaluated for solution behaviour and protease activity. Only those mutations 
that improve solution behaviour but have little impact on protease activity were 
chosen for double, triple, and quadruple mutations. Using this strategy, we gene- 
rated more than 100 PSH variants. Many of these variants were crystallized. The 
variant that gave the best X-ray diffraction contains five mutations: D40N, E42S, 
A147E, V148P and A229V. 

Protein expression and purification. The transformed BL21(DE3) cells were 
induced with 0.2 mM isopropyl-f-p-thiogalactopyranoside (IPTG) at an optical 
density of 1.5 at 600 nm. After growing for 16h at 22 °C, the cells were collected, 
homogenized in a buffer containing 25 mM Tris-Cl, pH 8.0, and 150mM NaCl, 
and lysed using a French Press with 2 passes at 15,000 p.s.i. Cell debris was removed 
by centrifugation. The supernatant was collected and applied to ultracentri- 
fugation at 150,000g for 1h. Membrane fraction was incubated with 2% (w/v) 
n-nonyl-B-b-maltopyranoside (NM, Anatrace) for 1h at 4°C. Following another 
ultracentrifugation step at 150,000g for 30 min, the supernatant was loaded onto 
Ni?*-NTA affinity column (Qiagen). The resin was washed with 25 mM Tris-Cl, 
pH 8.0, 150 mM NaCl, 20 mM imidazole, 0.6% NM. The protein was eluted from 
the affinity resin by 25mM Tris-Cl, pH 8.0, 150 mM NaCl, 250 mM imidazole and 
0.6% NM, and concentrated to about 20 mg ml before further purification by gel 
filtration (Superdex-200, GE Healthcare). The buffer for gel filtration contained 
25 mM Tris-Cl, pH 8.0, 150 mM NaCl, and 0.2% (w/v) n-nonyl-B-b-glucopyrano- 
side (B-NG, Anatrace) plus 0.023% (w/v) n-dodecyl-N,N-dimethylamine-N-oxide 
(LDAO, Anatrace). All PSH variants were generated using two-step PCR and were 
subcloned, overexpressed and purified in the same way as wild-type protein. 
Selenomethionine (Se-Met)-labelled protein was purified similarly with the excep- 
tion that 2mM Tris-[2-carboxyethyl]-phosphine (TCEP) was included during 
purification. 

The peak fraction of PSH from gel filtration was collected and incubated with 
the endoproteinase Glu-C from Staphylococcus aureus V8 (V8, Sigma-Aldrich) at 
22 °C for 20 min, with a final concentration of 0.03 mg ml ' for the V8 protease. 
After limited proteolysis, the resulting PSH protein was applied to gel filtration, 
which revealed two discrete fragments. Analysis by mass spectrometry revealed 
that an internal fragment in the extended loop connecting TM6 and TM7, resi- 
dues 182-209, was removed by the V8 protease treatment. The resulting two- 
chain PSH protein was used for crystallization. 

Crystallization. The concentration of protein for crystallization was approxi- 
mately 6.5 mg ml '. Crystals were grown at 18 °C by the hanging-drop vapour- 
diffusion method. The digested protein gave rise to crystals of trapezoid plates. 
The crystals appeared 3 days in the well buffer containing 0.1 M glycine pH 3.6, 
0.2M (NH4)2SO,, 20% (w/v) PEGSOOMME (Fluka), 6% (w/v) glycerol and 
0.04% (w/v) Anapoe-C,,Eg (Anatrace), and grew to full size in 3 weeks. These 
crystals are in three different space groups: C222, C222, and P2. Crystals of the 
C222 space group exhibited stronger diffraction than those of the other two space 
groups. Derivative crystals were obtained by soaking crystals for 20 min in mother 
liquor containing 2mM K,PtCl,. Both native and heavy-atom-derived crystals 
were directly flash-frozen in a cold nitrogen stream at 100 K. 

Data collection. The high-resolution native data was collected at Shanghai 
Synchrotron Radiation Facility (SSRF) beamline BL17U. Other diffraction data 
were collected at the SPring-8 beamline BL41XU. The platinum-soaked crystals 
are extremely sensitive to radiation damage, even at 100 K. To collect a complete, 
high-redundancy, 3-wavelength MAD data set, the size of the collimated X-ray 
beam was fine-tuned to maximize the number of exposures on a single crystal. For 
the same crystal, the three anomalous data sets (peak, inflection and high-energy 
remote) around the absorption edge of platinum were collected at three different 
positions, each for a different wavelength. All data were processed with the 
package HKL2000 (ref. 44) by routine procedure. The diffraction images from 
the severely anisotropic native crystal were collected at the SSRF beamline BL17U, 
and integrated with DENZO™. Before the .x files were input into SCALEPACK* 
for merging and scaling, the anisotropic ellipsoidal truncations on the .x files were 
performed with the special version of the ellipsoidal truncation program provided 


by the UCLA MBI - Diffraction Anisotropy Server”'. The applied resolution limits 
along a*, b*, and c* directions are 3.32, 4.0, and 3.32 A, respectively. No F/sigma 
cut-off was performed at any resolution shell within the ellipsoid. Selection of the 
resolution range was based on the criteria of F/sigma larger than 3.0 (see 
Supplementary Fig. 3a). The pruned raw data was then used for the final structural 
refinement. Further processing was carried out using programs from the CCP4 
suites**, Data collection statistics are summarized in Supplementary Tables 1-3. 
Structure determination. Six platinum atoms were determined using the pro- 
gram SHELXD*. Each PSH molecule binds at least one platinum atom through 
methionine residues. Four platinum atoms, one in each PSH molecule, display the 
same non-crystallographic symmetry (NCS) as that of the four PSH molecules in 
each asymmetric unit. The identified platinum sites were then refined and the 
initial phases were calculated by the program SOLVE™ with the multi-wavelength 
anomalous dispersion (MAD) phasing method. The Figure-Of-Merit (FOM) 
right after SOLVE was measurable only up to 5 A (black line in Supplementary 
Fig. 3b). The phases were refined and extended from 5 A to 3.9 A resolution by 
fourfold NCS (non-crystallographic symmetry) averaging, solvent flattening and 
histogram matching using DM from the CCP4 program suite”. The resulting 
electron density map displays all nine transmembrane helices. A crude partial «- 
helices model was traced by hand, and the resulting model was then used for 
molecular replacement with the program PHASER® into the C222, and P2 crystal 
forms. Cross-crystal averaging with the three different crystal forms in Supplemen- 
tary Table 1 combined with solvent flattening, histogram matching, and NCS avera- 
ging in DMMulti® yielded a map of sufficient quality for model building. 

An initial model was built into the experimental electron density map using 
COOT**. Sequence docking was aided with the selenium sites in the anomalous 
difference Fourier map. The structure of the C222 space group was refined with 
PHENIX”, using the following restraints: stereochemistry (including bond 
length, angle, torsion angle and chiral volume, etc.), NCS, secondary structure 
restraints (377 defined hydrogen bonds in «-helices), and experimental phases 
restraints. The refined model for one PSH molecule was used for molecular 
replacement into the C222, and P2 crystal forms, and the C222, and P2 structures 
were also refined using PHENIX” with the appropriate restraints. 

We used selenium anomalous signals to help validate the atomic model. There 
are 12 Met residues in the sequence of PSH, not counting the initiation Met. 
Except one Met, all 11 other Met residues are located on six ~-helices: TM1, TM2, 
TM3, TM6, TM7 and TM8. In the selenium anomalous difference Fourier map, 
all these 11 selenium sites were identified. However, there were no Met residues in 
TM4, TMS, and TM9. To confirm the sequence assignment for these three TMs, 
we generated three missense mutants of PSH: V114M on TM4, L133M on TMS, 
and L284M on TM, individually purified these three proteins to homogeneity, 
and crystallized these variants. Single-wavelength anomalous diffraction data 
from these crystals were collected at the absorption edge of selenium. Data col- 
lection statistics were shown in Supplementary Table 3. The anomalous densities 
around V114M, L133M and L284M in the anomalous difference Fourier map 
were in complete agreement with the PSH atomic model (Supplementary Fig. 6a). 
Protease activity assay. The proteolytic activity of PSH was examined by an in 
vitro cleavage assay. The MBP-fused transmembrane region of Gurken was used as 
the substrate’*. The assay was performed at 37 °C for 8 h ina buffer containing PBS, 
0.02% (w/v) DDM and 20 mM citrate pH 5.3. The concentrations of the substrate 
protein and protease were approximately 0.5mgml '. N-[[[(2R,3S)-3-[[(1,1- 
dimethylethoxy)carbonyl]amino]-2-hydroxy-4-phenylbuty]](phenylmethyl) 
amino]carbonyl]-L-leucyl-L-valine methyl ester WPE-III-31-C (abbreviated as 
III-31-C, Sigma-Aldrich), a y-secretase inhibitor” was dissolved in DMSO and 
used for inhibition assay. The reaction was stopped by SDS sample buffer and the 
cleavage products were analysed by SDS-PAGE and Coomassie staining. 
Structural modelling of human presenilin 1 (PS1). Presenilin 1 sequences from 
Homo sapiens, Xenopus laevis and Drosophila melanogaster were aligned to their 
archaeal homologue (PSH) based on the multiple sequence alignment algorithm 
implemented in the ClustalW“. The human presenilin 1 (PS1) model was built 
according to the sequence alignment results with the Sculptor” from the PSH 
coordinates. The missing side chains were built manually in COOT”, and the 
missing loops were built with the program SLOOP” by using fragments from the 
Richardson’s Top500 library of structures to fill the gaps. The model geometry 
was regularized with the program phenix.pdbtools -geometry_regularization”’. 
Structure prediction of PEN-2. The three-dimensional structure of PEN-2 was 
predicted by ab initio method because there is no published structure of similar 
sequences, In this method, amino acid sequences, fragment structure templates, 
and membrane environment were used as input. The three-residue fragments and 
nine-residue fragments were generated from the ROBETTA fragment server 
(http://robetta.bakerlab.org/fragmentsubmit.jsp). The membrane environment 
was defined from the predicted membrane normal and centre vectors from pre- 
dicted trans-membrane region using the method OCTOPUS”. The membrane 
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ab initio modelling protocol® from Rosetta 3.3 was implemented to take advant- 
age of the above information to generate 2000 models. The models with high 
scores and good topologies were selected as candidate structures. 
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Dwarf satellite galaxies are thought to be the remnants of the popu- 
lation of primordial structures that coalesced to form giant galaxies 
like the Milky Way’. It has previously been suspected’ that dwarf 
galaxies may not be isotropically distributed around our Galaxy, 
because several are correlated with streams of H I emission, and may 
form coplanar groups’. These suspicions are supported by recent 
analyses*”’. It has been claimed’ that the apparently planar distri- 
bution of satellites is not predicted within standard cosmology’, and 
cannot simply represent a memory of past coherent accretion. How- 
ever, other studies dispute this conclusion’"''. Here we report the 
existence of a planar subgroup of satellites in the Andromeda galaxy 
(M 31), comprising about half of the population. The structure is at 
least 400 kiloparsecs in diameter, but also extremely thin, with a 
perpendicular scatter of less than 14.1 kiloparsecs. Radial velocity 
measurements'’*’* reveal that the satellites in this structure have 
the same sense of rotation about their host. This shows conclusively 
that substantial numbers of dwarf satellite galaxies share the same 
dynamical orbital properties and direction of angular momentum. 
Intriguingly, the plane we identify is approximately aligned with the 
pole of the Milky Way’s disk and with the vector between the Milky 
Way and Andromeda. 

We undertook the Pan-Andromeda Archaeological Survey’® 
(PAndAS) to obtain a large-scale panorama of the halo of the Andro- 
meda galaxy (M 31), a view that is not available to us for the Milky 
Way. This Canada—France-Hawaii Telescope survey imaged about 
400 square degrees around M 31, which is the only giant galaxy in 
the Local Group besides our Milky Way. Stellar objects are detected 
out to a projected distance of about 150 kiloparsecs (kpc) from M 31, 
and about 50 kpc from M 33, the most massive satellite of M31. The 
data reveal a substantial population of dwarf spheroidal galaxies that 
accompany Andromeda”. 

The distances to the dwarf galaxies can be estimated by measuring 
the magnitude of the tip of the red-giant branch'*®. Improving on 
earlier methods, we have developed a Bayesian approach that yields 
the probability distribution function for the distance to each individual 
satellite’. In this way we now have access to homogeneous distance 
measurements (typical uncertainties 20-50 kpc) to the 27 dwarf gala- 
xies (filled circles in Fig. 1) visible within the PAndAS survey area”® 
that lie beyond the central 2.5°. 

In Fig. 2 these distance measurements are used to calculate the sky 
positions of the homogeneous sample of 27 dwarf galaxies as they 
would appear from the centre of the Andromeda galaxy. Visually, there 
appears to be a correlation close to a particular great circle (red line): 


this suggests that there is a plane, centred on M31, around which a 
subsample of the satellites have very little scatter. This is confirmed by 
the Monte Carlo analysis presented in the Supplementary Informa- 
tion, where we show that the probability of the alignment of the sub- 
sample of n,,, = 15 satellites marked red in Figs 1 and 2 occurring at 
random is 0.13% (see Supplementary Fig. 1). 

Following this discovery, we sought to investigate whether the sub- 
sample displayed any kinematic coherence. The radial velocity of each 
satellite is shown in Fig. 3, corrected for the bulk motion of the 
Andromeda system towards us: what is immediately striking is that 
13 out of the 15 satellites possess coherent rotational motion, such that 
the southern satellites are approaching us with respect to M 31, while the 
northern satellites recede away from us with respect to their host galaxy. 

The probability that 13 or more out of 15 objects should share the 
same sense of rotation is 1.4% (allowing for right-handed or left- 
handed rotation). Thus the kinematic information confirms the spatial 
correlation initially suspected from a visual inspection of Fig. 2. The 
total significance of the planar structure is approximately 99.998%. 

Thus we conclude that we have detected, with very high confidence, 
a coherent planar structure of 13 satellites with a root-mean-square 
thickness of 12.6 + 0.6 kpc (<14.1 kpc at 99% confidence), that coro- 
tate around M 31 with a (right-handed) axis of rotation that points ap- 
proximately east. The three-dimensional configuration can be assessed 
visually in Fig. 3. The extent of the structure is gigantic, over 400 kpc 
along the line of sight and nearly 300 kpc north-to-south. Indeed, since 
Andromeda XIV and Cassiopeia II lie at the southern and northern 
limits of the PAndAS survey, respectively, it is quite probable that 
additional (faint) satellites belonging to this structure are waiting to 
be found just outside the PAndAS footprint. Although huge in extent, 
the structure appears to be lopsided, with most of the satellites popu- 
lating the side of the halo of M 31 that is nearest to the Milky Way. The 
completeness analysis we have undertaken shows that this configura- 
tion is not due to a lowered detection sensitivity at large distance, but 
reflects a true paucity of satellites in the more distant halo hemisphere”’. 

The existence of this structure had been hinted at in earlier work’, 
thanks to the planar alignment we report on here being viewed nearly 
edge-on, although the grouping could not be shown to be statistically 
significant from the information available at that time. Other previ- 
ously claimed alignments do not match the present plane, although 
they share some of the member galaxies: the most significant align- 
ment of ref. 23 has a pole 45° away from that found here, and the 
(tentative) poles of the configuration in ref. 4 are 23.4° away, whereas 
those of a later contribution® have poles 34.1° and 25.4° distant from 
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Figure 1 | Map of the Andromeda satellite system. The homogeneous 
PAndAS survey (irregular polygon) provides the source catalogue for the 
detection and distance measurements of the 27 satellite galaxies” (filled circles) 
used in this study. Near M 31 (blue ellipse), the high background hampers the 
detection of new satellites and precludes reliable distance measurements for 
M 32 and NGC 205 (labelled black open circles); we therefore exclude the 
region inside 2.5° (dashed circle) from the analysis. The seven satellites known 
outside the PandAS area (green circles and arrows) constitute a heterogeneous 


M 31-centric galactic Jatitudes 


Figure 2 | Satellite galaxy positions as viewed from Andromeda. The Aitoff- 
Hammer projection shows the sample of 27 satellites” (filled circles from Fig. 1) 
as they would be seen from the centre of the Andromeda galaxy. In these 
coordinates the disk of Andromeda lies along the equator. ‘M 31-centric 
galactic latitude’ means what a fictitous observer in the M31 galaxy would 
call ‘galactic latitude’. The background image represents the probability 
density function of the poles derived from 10° iterations of resampling the 27 
satellites from their distance probability density functions, and finding the 


sample, discovered in various surveys with non-uniform spatial coverage, and 
their distances are not measured in the same homogeneous way. A reliable 
spatial analysis requires a data set with homogeneous selection criteria, so we do 
not include these objects in the sample either. The analysis shows that the 
satellites marked red are confined to a highly planar structure. We note that this 
structure is approximately perpendicular to lines of constant Galactic latitude, 
so it is therefore aligned approximately perpendicular to the Milky Way’s disk 
(the grid squares are 4° x 4°). 


0 


plane of lowest root mean square from a subsample of 15 (the colour scale on 
the right shows the relative probability of the poles, and is dimensionless). A 
clear narrow peak at (Iyj3; = 100.9° + 0.9°, byy3; = —38.2° + 1.4°) 

highlights the small uncertainty in the best-fit plane. The solid red line, 
which passes within less than 1° of the position of the Milky Way 

(yellow circle labelled ‘“MW’), represents the plane corresponding to this best 
pole location. 
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Figure 3 | Three-dimensional view (online only in the PDF version) or two- 
dimensional screenshot (in the print version) of the planar, rotating 
structure. The coordinate system is such that the z direction is parallel to the 
vector pointing from the Milky Way to M 31, x increases eastwards and y 
northwards. Only the radial component of the velocity of each satellite is 
measured, and these velocities are shown as vectors pointing either towards or 
away from the Milky Way. As in Figs 1 and 2, red spheres mark the planar 
satellites, and blue spheres represent the ‘normal’ population. The coherent 
kinematic behaviour of the spatially very thin structure (red) is clearly apparent 
viewed from the y-z plane. With the exception of And XII and And XXVII, the 
satellites in the planar structure that lie to the north of M 31 recede from us, 
whereas those to the South approach us; this property strongly suggests 


ours. Without the increased sample size, reliable three-dimensional 
positions and radial velocities, and most importantly, a spatially 
unbiased selection function resulting from the homogeneous pano- 
ramic coverage of PAndAS, the nature, properties and conclusive 
statistical significance of the present structure could not be inferred. 
The present detection proves that in some giant galaxies, a signifi- 
cant fraction of the population of dwarf satellite galaxies, in this case 
around 50% (13 out of 27 over the homogeneously surveyed PAndAS 
area), are aligned in coherent planar structures, sharing the same 
direction of angular momentum. The Milky Way is the only other 
giant galaxy where we have access to high-quality three-dimensional 
positional data, and the existence of a similar structure around our 
Galaxy is strongly suggested by current data*>’. The implications for 
the origin and dynamical history of dwarf galaxies are profound. It also 
has a strong bearing on the analyses of dark matter in these darkest of 


64 | NATURE | VOL 493 | 3 JANUARY 2013 


rotation. Our velocity measurements'* (supplemented by values from the 
literature"), have very small uncertainties, typically <5 kms_'. The irregular 
green polygon (visible only in the x-y plane of the three-dimensional online 
version) shows the PAndAS survey area, the white circle (visible only in the x-y 
plane of the three-dimensional online version) indicates a projected radius of 
150 kpc at the distance of M 31, and the white arrow (visible only in the three- 
dimensional online version) marks a velocity scale of 100 km s~ * (And XXVII 
is not shown in this diagram because its most likely distance is 476 kpc behind 
M31). This figure is three-dimensionally interactive in the online version 
(allowing the reader to change the magnification and viewing angle), and was 
constructed with the S2PLOT programming library”®. 


galaxies, because one cannot now justifiably assume such objects to 
have evolved in dynamical isolation. 

Intriguingly, the Milky Way lies within 1° of the plane reported here, 
the pole of the plane and the pole of the Milky Way’s disk are approxi- 
mately perpendicular (81°), and furthermore this plane is approximately 
perpendicular to the plane of satellites that has been proposed to sur- 
round the Galaxy (given that its pole points approximately towards 
Andromeda’ within the uncertainties). Although these alignments may 
be chance occurrences, it is nevertheless essential information about the 
structure of the nearby Universe that must be taken into account in 
future simulations aimed at modelling the dynamical formation history 
of the Local Group. 

The formation of this structure around M31 poses a puzzle. For 
discussion, we envisage two broad classes of possible explanations: 
accretion or in situ formation. In either type of model, the small scatter 
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of the satellites out of the plane is difficult to explain, even though the 
orbital timescales for the satellites are long (around 5 Gyr for satellites 
at 150 kpc). All the galaxies in the plane are known to have old, evolved, 
stellar populations, and so in situ formation would additionally imply 
that the structure is ancient. 

In an accretion scenario, the dynamical coherence points to an 
origin in a single accretion of a group of dwarf galaxies. However, 
the spatial extent of the progenitor group would have to be broadly 
equal to or smaller than the current plane thickness (<14.1 kpc), yet no 
such groups are known. Interpreting the coherent rotation as a result 
of our viewing perspective~* requires a bulk tangential velocity for the 
in-falling group of the order of 1,000kms_', which seems unphysi- 
cally high. A further possibility is that we are witnessing accretion 
along filamentary structures that are fortuitously aligned. In situ for- 
mation may be possible if the planar satellite galaxies formed like tidal- 
dwarf galaxies in an ancient gas-rich galaxy merger’, but then the 
dwarf galaxies should be essentially devoid of dark matter. If the planar 
M 31 dwarfs are dynamically relaxed, the absence of dark matter would 
be greatly at odds with inferences from detailed observations” of Milky 
Way satellites, assuming the standard theory of gravity. An alternative 
possibility is that gas was accreted preferentially onto dark matter sub- 
halos that were already orbiting in this particular plane, but then the 
origin of the plane of sub-haloes would still require explanation. We 
conclude that it remains to be seen whether galaxy formation models 
within the current cosmological framework can explain the existence 
of this vast, thin, rotating structure of dwarf galaxies within the halo of 
our nearest giant galactic neighbour. 
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The nucleus of the Milky Way is known to harbour regions of 
intense star formation activity as well as a supermassive black hole’. 
Recent observations have revealed regions of y-ray emission reach- 
ing far above and below the Galactic Centre (relative to the Galactic 
plane), the so-called ‘Fermi bubbles”. It is uncertain whether these 
were generated by nuclear star formation or by quasar-like out- 
bursts of the central black hole** and no information on the struc- 
tures’ magnetic field has been reported. Here we report observations 
of two giant, linearly polarized radio lobes, containing three ridge- 
like substructures, emanating from the Galactic Centre. The lobes 
each extend about 60 degrees in the Galactic bulge, closely corres- 
ponding to the Fermi bubbles, and are permeated by strong mag- 
netic fields of up to 15 microgauss. We conclude that the radio lobes 
originate in a biconical, star-formation-driven (rather than black- 
hole-driven) outflow from the Galaxy’s central 200 parsecs that 
transports a huge amount of magnetic energy, about 10°° ergs, into 
the Galactic halo. The ridges wind around this outflow and, we 
suggest, constitute a ‘phonographic record of nuclear star forma- 
tion activity over at least ten million years. 

We use the images of the recently concluded S-band Polarization All 
Sky Survey (S-PASS) that has mapped the polarized radio emission of 
the entire southern sky. The survey used the Parkes Radio Telescope at 
a frequency of 2,307 MHz, with 184 MHz bandwidth, and 9’ angular 
resolution’. 

The lobes we report here exhibit diffuse polarized emission (Fig. 1), 
an integrated total intensity flux of 21 kJy, and a high polarization 
fraction of 25%. They trace the Fermi bubbles excepting the top western 
(that is, right) corners where they extend beyond the region covered by 
the y-ray emission structure. Depolarization by H 11 regions establishes 
that the lobes are almost certainly associated with the Galactic Centre 
(Fig. 2 and Supplementary Information), implying that their height is 
~8kpc. Archival data of WMAP* reveal the same structures at a 
microwave frequency of 23 GHz (Fig. 3). The 2.3-23 GHz spectral 
index « (with flux density S at frequency v modelled as S, « v*) of 
linearly polarized emission interior to the lobes spans the range — 1.0 to 
—1.2, generally steepening with projected distance from the Galactic 
plane (see Supplementary Information). Along with the high polari- 
zation fraction, this phenomenology indicates that the lobes are due 
to cosmic-ray electrons, transported from the plane, synchrotron- 
radiating in a partly ordered magnetic field. 

Three distinct emission ridges that all curve towards Galactic west 
with increasing Galactic latitude are visible within the lobes (Fig. 1); 
two other substructures proceeding roughly northwest and southwest 
from around the Galactic Centre hint at limb brightening in the bico- 
nical base of the lobes. These substructures all have counterparts in 
WMAP polarization maps (Fig. 3), and one of them’, already known 


from radio continuum data as the Galactic Centre spur’®, appears to 
connect back to the Galactic Centre; we label the other substructures 
the northern and southern ridges. The ridges’ magnetic field directions 
(Fig. 3) curve, following their structures. The Galactic Centre spur and 
southern ridges also seem to have GeV y-ray counterparts (Fig. 2; also 
compare ref. 3). The two limb brightening spurs at the biconical lobe 
base are also visible in the WMAP map, where they appear to con- 
nect back to the Galactic Centre area. A possible third spur develops 
northeast from the Galactic Centre. These limb brightening spurs are 
also obvious in the Stokes U map as an X-shaped structure centred at 
the Galactic Centre (Supplementary Fig. 3). 

Such coincident, non-thermal radio, microwave and y-ray emission 
indicates the presence of a non-thermal electron population covering 
at least the energy range 1-100 GeV (Fig. 4) that is simultaneously 
synchrotron-radiating at radio and microwave frequencies and 
upscattering ambient radiation into y-rays by the inverse Compton 
process. The widths of the ridges are remarkably constant at ~300 pc 
over their lengths. The ridges have polarization fractions of 25-31% 
(see Supplementary Information), similar to the average over the lobes. 
Given this emission and the stated polarization fractions, we infer 
magnetic field intensities of 6-12 uG for the lobes and 13-15 uG for 
the ridges (see Figs 2 and 3, and Supplementary Information). 

An important question about the Fermi bubbles is whether they are 
ultimately powered by star formation or by activity of the Galaxy’s 
central, supermassive black hole. Despite their very large extent, the 
y-ray bubbles and the X-shaped polarized microwave and X-ray struc- 
tures tracing their limb-brightened base" have a narrow waist of 
only 100-200 pe diameter at the Galactic Centre. This matches the 
extent of the star-forming molecular gas ring (of ~3 X 10’ solar 
masses) recently demonstrated to occupy the region’”. With 5-10% 
of the Galaxy’s molecular gas content’, star-formation activity in this 
‘central molecular zone’ is intense, accelerating a distinct cosmic ray 
population’*”* and driving an outflow’* of hot, thermal plasma, 
cosmic rays and ‘frozen-in’ magnetic field lines®!*"°. 

One consequence of the region’s outflow is that the cosmic ray 
electrons accelerated there (dominantly energized by supernovae) are 
advected away before they lose much energy radiatively in situ'*'°"”. 
This is revealed by the fact that the radio continuum flux on scales up 
to 800 pc around the Galactic Centre is in anomalous deficit with 
respect to the expectation afforded by the empirical far-infrared/radio 
continuum correlation’’. The total 2.3GHz radio continuum flux 
from the lobes of ~21 kJy, however, saturates this correlation as nor- 
malized to the 601m flux (2 MJy) of the inner ~160 pc diameter 
region (ref. 19). Together with the morphological evidence, this 
strongly indicates that the lobes are illuminated by cosmic ray elec- 
trons accelerated in association with star formation within this region 
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Figure 1 | Linearly polarized intensity Pat 2.3 GHz from S-PASS. The thick 
dashed lines delineate the radio lobes reported in this Letter, while the thin 
dashed lines delimit the y-ray Fermi bubbles*. The map is in Galactic 
coordinates, centred at the Galactic Centre with Galactic east to the left and 
Galactic north up; the Galactic plane runs horizontally across the centre of the 
map. The linearly polarized intensity flux density P (a function of the Stokes 
parameters Q and U,P = /Q? + U?) is indicated by the colour scale, and given 
in units of Jy per beam with a beam size of 10.75’ (1Jy = 10°7°Wm 7Hz_?). 
The lobes’ edges follow the y-ray border up to Galactic latitude b ~ |30|°, from 
which the radio emission extends. The three polarized radio ridges discussed in 
the text are also indicated, along with the two limb brightening spurs. The 


(see Supplementary Information), and that the lobes are not a result of 
black hole activity. 

The ridges appear to be continuous windings of individual, colli- 
mated structures around a general biconical outflow out of the Galactic 
Centre. The sense of Galactic rotation (clockwise as seen from Galactic 
north) and angular-momentum conservation mean that the ridges 
get ‘wound up” in the outflow with increasing distance from the 
plane, explaining the projected curvature of the visible, front-side 
of the ridges towards Galactic west. Polarized, rear-side emission is 
attenuated, rendering it difficult to detect against the stronger emission 
from the lobes’ front-side and the Galactic plane (Fig. 1 and Sup- 
plementary Information). 

For cosmic ray electrons synchrotron-emitting at 2.3 GHz to be able 
to ascend to the top of the northern ridge at ~7 kpc in the time it takes 
them to cool (mostly via synchrotron emission itself) requires vertical 
transport speeds of >500kms° ' (fora field of 15 1G; see Fig. 4). Given 
the geometry of the Galactic Centre spur, the outflowing plasma is 
moving at 1,000-1,100kms * (Fig. 4 and Supplementary Infor- 
mation), somewhat faster than the ~900 km s gravitational escape 
velocity from the Galactic Centre region’, implying that 2.3-GHz- 
radiating electrons can, indeed, be advected to the top of the ridges 
before they lose all their energy. 


ridges appear to be the front side of a continuous winding of collimated 
structures around the general biconical outflow of the lobes (see text). The 
Galactic Centre spur is nearly vertical at low latitude, possibly explained by a 
projection effect if it is mostly at the front of the northern lobe. At its higher 
latitudes, the Galactic Centre spur becomes roughly parallel with the northern 
ridge (above), which itself exhibits little curvature; this is consistent with the 
overall outflows becoming cylindrical above 4-5 kpc as previously suggested’’. 
In such a geometry, synchrotron emission from the rear side of each cone is 
attenuated by a factor 2 2 with respect to the front side, rendering it difficult to 
detect the former against the foreground of the latter and of the Galactic plane 
(see Supplementary Information). 


Given the calculated fields and the speed of the outflow, the total 
magnetic energy for each of the ridges, (4-9) X 10°" erg (see Sup- 
plementary Information), is injected at a rate of ~10°’ ergs ‘ over a 
few million years; this is very close to the rate at which independent 
modelling® suggests Galactic Centre star formation is injecting mag- 
netic energy into the region’s outflow. On the basis of the ridges’ indi- 
vidual energetics, geometry, outflow velocity, timescales and plasma 
content (see Supplementary Information), we suggest that their foot- 
points are energized by and rotate with the super-stellar clusters inha- 
biting’ the inner ~ 100 pc (in radius) of the Galaxy. In fact, we suggest 
that the ridges constitute ‘phonographic’ recordings of the past 
~10 Myr of Galactic Centre star formation. Given its morphology, 
the Galactic Centre spur probably still has an active footprint. In con- 
trast, the northern and southern ridges seem not to connect to the 
plane at 2.3GHz. This may indicate their footpoints are no longer 
active, though the southern ridge may be connected to the plane by 
a y-ray counterpart (see Fig. 2). Unfortunately, present data do not 
allow us to trace the Galactic Centre spur all the way down to the 
plane: but a connection is plausible between this structure and one 
(or some combination) of the ~1°-scale radio continuum spurs'>”* 
emanating north of the star-forming giant molecular cloud complexes 
Sagittarius B and C; a connection is also plausible with the bright, 
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non-thermal ‘radio arc’! (itself longitudinally coincident with the 
~4-Myr-old Quintuplet” stellar cluster). 

The magnetic energy content of both lobes is much larger than the 
ridges, (1-3) X 10°’ erg. This suggests the magnetic fields of the lobes 
are the result of the accumulation of a number of star formation 
episodes. Alternatively, if the lobes’ field structure were formed over 
the same timescale as the ridges, it would have to be associated with 


Figure 2 | Lobes’ polarized intensity and y-ray spurs. Schematic rendering of 
the edges of two y-ray substructures evident in the 2-5 GeV Fermi data as 
displayed in figure 2 of ref. 2, which seem to be counterparts of the Galactic 
Centre spur and the southern ridge. The map is in Galactic coordinates, with 
Galactic east to the left and Galactic north up; the Galactic plane runs 
horizontally across the centre of the map approximately. The linearly polarized 
intensity flux density P is indicated by the colour scale, and given in units of Jy 
per beam with a beam size of 10.75’. The latter appears to be connected to the 
Galactic Centre by its y-ray counterpart. With the flux densities and 
polarization fraction quoted in the text, we can infer equipartition’® magnetic 
field intensities of B.g~ 6 WG (1 wG= 10 '°T) if the synchrotron-emitting 
electrons occupy the entire volume of the lobes, or ~12 1G if they occupy only a 
300-pce-thick skin (the width of the ridges). For the southern ridge, Beg ~ 13 UG; 
for the Galactic Centre spur, Beg ~ 15 G; and, for the northern ridge, 

Beg ~ 14 uG. Note the large area of depolarization and small-angular-scale 
signal modulation visible across the Galactic plane extending up to |b| ~ 10° 
on either side of the Galactic Centre (thin dashed line). This depolarization is 
due to Faraday rotation by a number of shells that match Hx emission regions”, 
most of them lying in the Sagittarius arm at distances from the Sun up to 

2.5 kpc, and some in the Scutum-Centaurus arm at ~3.5 kpc. The small-scale 
modulation is associated with weaker Ha emission encompassing the same 
H11 regions and most probably associated with the same spiral arms. Thus 
2.5 kpc constitutes a lower limit to the lobes’ near-side distance and places 
the far side beyond 5.5 kpc from the Sun (compare ref. 9). Along with their 
direction in the sky, this suggests that the lobes are associated with the Galactic 
bulge and/or Centre. 


recent activity of the supermassive black hole, perhaps occurring in 
concert with enhanced nuclear star-formation activity*. 

Our data indicate that the process of gas accretion onto the Galactic 
nucleus inescapably involves star formation which, in turn, energizes 
an outflow. This carries away low-angular-momentum gas, cosmic 
rays and magnetic field lines, and has a number of important conse- 
quences. First, the dynamo activity in the Galactic Centre”, probably 
required to generate its strong” in situ field, requires the continual 
expulsion of small-scale helical fields to prevent dynamo saturation”; 
the presence of the ridges high in the halo may attest to this process. 
Second, the lobes and ridges reveal how the very active star formation 
in the Galactic Centre generates and sustains a strong, large-scale 
magnetic field structure in the Galactic halo. The effect of this on 
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Figure 3 | Polarized intensity and magnetic angles at 23 GHz from 
WMAP*. The magnetic angle is orthogonal to the emission polarization angle 
and traces the magnetic field direction projected on to the plane of the sky 
(headless vector lines). The three ridges are obvious while traces of the radio 
lobes are visible (2.3 GHz edges shown by the black solid line). The magnetic 
field is aligned with the ridges and curves following their shape. Two spurs 
match the lobe edges northwest and southwest of Galactic Centre and could be 
limb brightening of the lobes. A third limb brightening spur candidate is also 
visible northeast of the Galactic Centre. The map is in Galactic coordinates, 
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centred at the Galactic Centre. Grid lines are spaced by 15°. The emission 
intensity is plotted as brightness temperature, in K. The vector line length is 
proportional to the polarized brightness temperature (the scale is shown by the 
line in the bottom-left corner, in K). Data have been binned in 1° X 1° pixels to 
improve the signal-to-noise ratio. From a combined analysis of microwave and 
y-ray data (see also Supplementary Information) we can derive the following 
magnetic field limits (complementary to the equipartition limits reported in the 
text and Fig. 2): for the overall lobes/bubbles, B > 9 1G; and for the Galactic 
Centre spur, 11 WG< B< 18 pG. 
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Figure 4 | The vertical range of cosmic ray electrons as a function of their 
kinetic energy, E.. Two cases are reported, for field amplitudes of 15 and 6 1G 
(blue and red curves, respectively). Owing to geometrical uncertainties, adiabatic 
losses cannot be determined so the plotted range (y axis) actually constitutes an 
upper limit. Electrons are taken to be transported with a speed given by the sum 
of the inferred vertical wind speed (1,100 kms7 ') and the vertical component 
of the Alfven velocity in the magnetic field. The former is inferred from the 
geometry of the northern ridge: if its footpoint has executed roughly half an 
orbit in the time the Galactic Centre spur has ascended to its total height of 
~4kpe, its upward velocity must be close to? 1,000kms ! X (r/100 pc) | X 
Vrot/ (80 km s~'), where we have normalized to a footpoint rotation speed of 
80kms- ‘at a radius of 100 pc from the Galactic Centre’ (detailed analysis 
gives 1,100kms_ ': see Supplementary Information). In a strong, regular 
magnetic field, the electrons are expected to stream ahead of the gas at the 
Alfven velocity”* in either the ridges (B~15 yuG, v\°"'~300 km s~!; this is a 
lower limit given that my; <0.008 cm~? on the basis of the ROSAT data”) or in 
the large-scale field of the lobes (B~6 yuG, vi" 2100 km s~! for 

ny 0.004 cm~? in the lobes’ interior as again implied by the data). Also 
plotted as the vertical dashed lines are the characteristic energies of electrons 
synchrotron radiating at 2.3 and 23 GHz (fora 15 1G field) and into 1-GeV and 
50-GeV y-rays via inverse Compton (‘IC’) upscattering of a photon 
background with characteristic photon energy 1 eV; and the approximate 7 kpc 
distance of the top of the northern ridge from the Galactic plane. 


the propagation of high-energy cosmic rays in the Galactic halo should 
be considered. Third, the process of gas expulsion in the outflow may 
explain how the Milky Way’s supermassive black hole is kept relatively 
quiescent’, despite sustained, inward movement of gas. 
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Optical-field-induced current in dielectrics 
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The time it takes to switch on and off electric current determines 
the rate at which signals can be processed and sampled in modern 
information technology’. Field-effect transistors’ *°° are able 
to control currents at frequencies of the order of or higher than 
100 gigahertz, but electric interconnects may hamper progress 
towards reaching the terahertz (10'” hertz) range. All-optical injec- 
tion of currents through interfering photoexcitation pathways””° 
or photoconductive switching of terahertz transients'’'° has made 
it possible to control electric current on a subpicosecond time- 
scale in semiconductors. Insulators have been deemed unsuitable 
for both methods, because of the need for either ultraviolet light 
or strong fields, which induce slow damage or ultrafast break- 
down’’”°, respectively. Here we report the feasibility of electric 
signal manipulation in a dielectric. A few-cycle optical waveform 
reversibly increases—free from breakdown—the a.c. conductivity 
of amorphous silicon dioxide (fused silica) by more than 18 orders 
of magnitude within 1 femtosecond, allowing electric currents to 
be driven, directed and switched by the instantaneous light field. 
Our work opens the way to extending electronic signal processing 
and high-speed metrology into the petahertz (10'° hertz) domain. 

Three basic types of solid—metals, semiconductors and dielec- 
trics—fundamentally differ in their reaction to an applied electric field. 
Metals respond to a small field with a current linearly proportional to 
it. This implies a strong screening that prevents high fields and charge 
density gradients from forming inside metals, which makes it fun- 
damentally difficult to control their responses. By contrast, semicon- 
ductors, in which there is a relatively small but non-zero energy gap, 
A @ between the valence and conduction bands, allow electric fields to 
penetrate and charge gradients to build up. This forms the basis of 
contemporary digital electronics'**'. Dielectrics, with their large 
bandgap and conduction and valence band widths, offer the fastest 
response. However, because of their reaction to applied fields (that is, 
extremely low conductivity at low fields and breakdown at high 
fields'”*°), they have been thought to be unsuitable for electronic 
signal processing in field-effect devices. Here we demonstrate that a 
strong, few-cycle optical field is capable of transforming the dielectric 
into a state of highly increased polarizability, allowing optical currents 
to flow and resulting in macroscopic charge separation that is detect- 
able in an external circuit. 

In our experiments, we exposed a fused silica” sample (4,~9 eV and 
conduction band width 4.~10 eV) to a strong, waveform-controlled, 
few-cycle field F,(t) with a photon energy of ha@y,~1.7 eV (where h 
denotes Planck’s constant divided by 2m and «, is the carrier angular 
frequency of the optical field), which transforms the dielectric into an 
optically conducting state, that is, injects carriers. We therefore refer to 
this field as the injection field. By carriers, we mean electrons in highly 
polarizable states at optical frequencies (rather than ones capable of con- 
ducting d.c. current, requiring delocalized electrons). Two unbiased gold 
electrodes collect the charge separated by the optical-field-induced current 
(Fig. 1a, Methods Summary and Supplementary Information). First, the 


applied external field F(t) = Fof(t) cos (@it+ cg), with a controlled 
carrier-envelope phase (CEP) @,, a sub-4-fs envelope f(t) with 
f(0)=1, and Fo~1.7V A |, was polarized perpendicularly to the metal- 
dielectric interface, along coordinate x, to drive the generated carriers 
towards the electrodes (Fig. la). Fig. 2 plots Qp, the charge transferred 
through the ammeter shown in Fig. 1 during exposure to a single laser 
pulse, as a function of the change in @, (Fig. 2a) and as a function of Fo 
(Fig. 2b). The CEP of the laser pulse is shifted by Ag;, on changing the 
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Figure 1 | Optical-field-induced conductivity and current control in a 
dielectric. a, Schematic of the metal-dielectric nanojunction. b, Schematic 
illustration of the adiabatic energy levels of the electronic states in the valence 
band (VB, purple) and conduction band (CB, green) of the dielectric under the 
influence ofa static or slowly varying strong electric field. The eigenenergies fan 
out as the strength of the electric field increases, resulting in avoided crossings, 
that is, anticrossings (inset). At low fields, the valence band states are occupied, 
and the conduction band states are empty. As the field strength increases, the 
valence band and conduction band levels cross, but the respective Wannier- 
Stark states are localized at distant sites, and the anticrossings are passed 
diabatically (that is, the conduction band states remain unpopulated) until the 
field approaches or exceeds ~1 V A™'. At these field strengths, electrons may 
be promoted into the conduction band via Zener tunnelling, leaving the 
electron in the lower-energy state after the passage of the anticrossing (adiabatic 
transition, depicted by dashed arrow). The resultant unoccupied valence band 
states mediate strong single-photon resonances at visible/near-infrared angular 
frequencies ®yqR/vis Within the valence band (red arrow). The emergence of 
these resonances results in a strong transient polarizability. 
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Figure 2 | Carrier-envelope-phase control and intensity dependence of 
optical-field-generated electric current in SiO>. a, Plot of the g,,-dependent 
component of Qp(Al) against the change, Al, in the propagation length through 
a fused silica wedge with respect to the propagation length yielding the 
minimum pulse duration (Al= 0), for polarizations perpendicular (along x; 
squares) and parallel (in the y-z plane; circles) to the metal—dielectric interface 
(Fo~1.7 V A’). The data represent an average of several consecutively 
acquired values at a given value of Al. Error bars show the standard deviation, 
and most are smaller than the size of the symbols representing the mean values 
of the data. The black dashed line shows the prediction of our quantum 
mechanical model. For the parallel polarization, the signal is suppressed by 
more than an order of magnitude. The residual signal is attributed to 
microscopic imperfections of the macroscopically plane metal-dielectric 
interface, which result in locations with a non-zero perpendicular component 
of the field. The red dashed line (right axis) depicts the change in pulse duration 
tp (full width at half maximum intensity) as a function of Al, taking into 
account the group velocity dispersion of the visible/near-infrared pulse in the 
fused silica wedge. b, Plot of the maximum amplitude, |Qp(Al=0)|, of the 
transferred charge against the peak amplitude, Fo, of the applied external field 
polarized along x: measurement (squares) and theoretical prediction (dashed 
line). For a given Fp value, |Qp(Al=0)| is determined by fitting the most 
pronounced oscillation of Qp(Al) (a, squares) with a sine function. The vertical 
error bars account for the standard deviation of such fit. Different values of Fo 
correspond to different beam sizes. Data points have been normalized 
accordingly. Fy is determined by monitoring the pulse energy and the laser 
beam waist at the focus, and the horizontal error bars quantify random 
fluctuations in these parameters. Ip is the corresponding peak intensity of the 
optical pulse. Data from our model have been multiplied by the effective cross- 
section, Aer, of the metal-dielectric interfaces confining the active volume of 
the dielectric (main text). 


propagation length by Al in a pair of thin, fused silica wedges” (Methods 
Summary and Supplementary Fig. 1). 

We find that Qp varies quasi-periodically with Al. Its minimum at 
Al=0 turns into a maximum at Al= + Algeph, with Algeph = 23 um 
shifting” the CEP by Agcy = 1, which results in an approximate inver- 
sion of the optical waveform: Fj(t,Aldeph)~ — Fi(t,0). This reversal of 
Qp and its order-of-magnitude reduction when F(t) is polarized par- 
allel to the metal—dielectric interface (that is, in the y-z plane) provide 
the first significant indication that this current is induced directly by 
the instantaneous light field. The maximum value of Qp scales nearly 
exponentially with the field amplitude and reaches ~10* electrons 
before breakdown (Fig. 2b). This nonlinear dependence is responsible 
for the decay of Qp(AJ) with increasing Al, as a consequence of dis- 
persive pulse broadening. 

To decouple the injection and driving processes, we irradiated the 
hybrid junction with two synchronized, ouhepocaly polarized fields. 
An injection field with a peak amplitude of FS ~2V A~! was pola- 
rized parallel to the electrode-insulator ‘ateraes (that is, in the y-z 
plane) to prevent it from driving current through the circuit. This dri- 
ving was accomplished by a weaker a of the same field, the drive 
field Fy(t), which had an amplitude of Fi V—~92VA? (Fig. 1a). The 
delay, At, between the peaks of F,(t) and Fal t) determines the timing of 
carrier injection with respect to the drive field and thereby controls 
the momentum that Fy(f) transfers to the charge carriers. Fig. 3a shows 


LETTER 


Qp versus At for the CEPs of the injection (g\.) and drive (go) pulses 
corresponding 19 one of the zero-crossings of Qp near Al = 0 in Fig. 2a. 
Such values of pe. and 0? are chosen to avoid a residual (background) 
current originating in any of the fields independently (Fig. 2a). 

The transferred charge periodically changes its sign as a function of 
At, revealing two major current reversal cycles equal to the field oscil- 
lation cycle of Fa(t) (Fig. 3a). The magnitude of the signal rapidly 
decays outside this few-femtosecond delay interval. Fig. 3b plots the 
result of the same measurement performed with an ‘inverted’ drive 
field, in which the field oscillations are reversed by shifting the CEP by 
Ag® = =m. For any value of At, the transferred charge is reversed with 
respect to its value at the same delay in Fig. 3a. This is exactly the 
behaviour we intuitively expect for a current driven directly by Fy(t). 
The observed Qp(At) reflecting the oscillating behaviour of Fg(t) pro- 
vides compelling evidence that the light field governs the current that 
emerges from the dielectric medium and is measured in the external 
circuit. The transferred charge was also found to depend sensitively on 
the CEP of the injection field pe. (Supplementary Fig. 7). This also 
corroborates the evidence that carrier injection is induced directly by 
the instantaneous field F,(t) rather than by effects governed by the 
cycle-averaged intensity. Experiments on free-standing electrodes kept 
under helium flux did not yield any ~cg-dependent electronic signal 
under identical irradiation conditions (Supplementary Fig. 10), and so 
we identify the insulator as the primary source of current in our 
experiments. 

Fig. 3c plots the electric field waveform of the drive pulse retrieved 
from attosecond streaking’. Such streaking experiments were per- 
formed in a parallel ultrahigh vacuum set-up independently from 
the light-field-induced current measurements and under identical 
laser conditions. The good correspondence between the solid lines in 
Fig. 3a and Fig. 3c suggests that the injected carriers sample the optical 
field with good fidelity. Carrier injection must therefore be substan- 
tially confined to a half-period, that is, to a time window of ~1 fs. This 
confinement is also indicative of a strong nonlinearity, in accordance 
with Fig. 2b. 

To describe our observations theoretically, we calculate the current 
density, j,(t), generated in the bulk insulator along coordinate x by 
solving the time-dependent Schrédinger equation for a dielectric film 
with a thickness in the range 50-500 nm exposed to a strong optical 
field. We neglect the Coulomb interaction between the electrons, 
which is justified by the extremely short (~1 fs) timescale on which 
the processes occur. The time dependence of the applied electric fields 
is taken from attosecond-streaking experiments (Fig. 3c). We compute 
jx(t) as the temporal derivative of the field-induced polarization, mul- 
tiplying it by the effective cross-section, Ae, of the metal-dielectric 
interfaces confining the active volume of the dielectric. More details 
are given in the Supplementary Information. 

Our computations predict the dependence of 


Qp = Aest | jx(t) dt 


on the CEP (Fig. 2a, black dashed line), the peak electric field (Fig. 2b), 
and the delay between the injection and drive fields for different Fa(t) 
(Fig. 3) and Fi(t) (Supplementary Fig. 7) waveforms. All the observed 
dependencies have been excellently reproduced using parameters— 
defining the band structure and three relevant dipole matrix ele- 
ments—that we chose once and did not subsequently adjust. The 
parameter A. was adjusted to fit the magnitude of the collected 
charge, and Aeg~5 x 107” m? gave the best agreement with the mea- 
surements. This value matches the experimentally estimated metal- 
dielectric cross-section within an order of magnitude. Hence, our 
microscopic model predicts our observations both qualitatively and, 
within an order of magnitude, quantitatively. This quantitative agree- 
ment also implies that a significant fraction of the charge separated by 
the optical currents in the dielectric is transferred to the metal leads, in 
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Figure 3 | Subfemtosecond control of electric current with the electric field 
of light. a, b, Transferred charge, Qp, versus delay, At, between the injection 
(F,(t)) and drive (F,(t)) fields, with parameters as described in the text. For 
At <0, Fy(t) precedes F;,(t). The CE phases of the injection and drive fields, oa 
and 0 respectively, are set such that Qp induced independently by any of the 
two fields vanishes (insets). The experimental data shown were recorded 
under identical conditions except for oe, which is shifted by 1 between the two 
measurements (compare the insets). The experimental data (circles) represent 
an average of several consecutively acquired values at a given At value. The 
error bars are the respective standard deviations. The solid lines correspond to 
the smoothed data obtained by averaging over adjacent points. The dashed lines 
show the prediction of our microscopic quantum mechanical model. Data from 
our model have been multiplied by the effective cross-section, Ae, of the 
metal-dielectric interfaces confining the active volume of the dielectric (main 
text). The slightly asymmetric polarity of the experimental signal could be due 


line with the predictions of a theoretical study of optically induced 
currents in a single-molecule nanojunction’® (Supplementary Inform- 
ation, section 3.4). 

The calculated j,(t) in Fig. 3c and Supplementary Fig. 3 reveals 
ultrafast turn-on and turn-off behaviour, allowed by the extremely 
wideband energy spectrum of the dielectric. This predicted temporal 
behaviour enables us to evaluate the peak current density from our 
measurements. With a maximum value of Qp(At) on the order of 
0.1 A fs (Fig. 3a, b) and Ag as specified above, our experimental data 
yield a transferred charge density of the order of 10!° A fsm™ 7. Given 
that the duration of the current density pulse is of the order of 1 fs (see 
jx(t) in Figs 3c and Supplementary Fig. 3), we estimate a peak current 
density perpendicular to the electrodes of J,~10'° Am “, driven by a 
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to the non-zero residual current for each individual pulse. The dependence of 
the current on At demonstrates that the charge-carrier generation takes place 
predominantly in the silica medium and that electrons of the gold electrodes, 
which are not sensitive to the component of the field parallel to the metal- 
insulator interface, that is, F(t), do not contribute to the optically injected 
current. c, Real-time optical electric field of the visible/near-infrared pulses 
retrieved from attosecond-streaking measurements” (solid black line), which 
were performed under identical laser conditions. The magnitude of the field is 
normalized to the strength of the drive pulse, F4(#), used in the photocurrent 
experiment. The red dashed curve shows the time-dependent current density, 
jx(t), as calculated from our quantum mechanical model for At= 1.2 fs. The 
vertical black dashed lines provide a guide to the eye for comparing the 
temporal evolution of the measured electronic signal with that of the drive 
electric field. 


peak field amplitude of FO ~0.2 x 10!°Vm_'. We may now intro- 
duce an effective electrical conductivity at optical frequencies as 
Cet(M) =Jy Fo ~5Q'm |, which exceeds the static d.c. conduc- 
tivity of amorphous silica, ga-si0, < 107 BQ m by more than 18 
orders of magnitude. 

Our microscopic theory correctly predicts the field-induced in- 
crease in polarizability that is responsible for the observed current, 
but does not provide intuitive insight into its origin. We suggest, with 
reference to a detailed discussion in the Supplementary Information, 
that this field-enhanced polarizability can be qualitatively interpreted 
in terms of the dynamic formation of Wannier-Stark states*** loca- 
lized at a certain site, /, of the lattice along the direction of the applied 
field, F, with an energy of E; = E, — eaFl (e is the electron charge, a is 
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the lattice constant and E, is the band offset) in the adiabatic limit 
(Fig. 1b). For high fields, F>1VA~', the eigenenergies of the 
Wannier-Stark states from the valence band and conduction band 
that reside at neighbouring lattice sites |I,) and |.) become equal; 
see the avoided crossing and anticrossing of the respective purple 
and green plots of energy against field the inset of Fig. 1b. For increa- 
sing field strength, adiabatic passage of such an anticrossing via Zener- 
type tunnelling” may occur with a significant probability (Fig. 1b inset, 
dashed arrow), emptying a valence band state. The resultant unoccu- 
pied valence band states give rise to strong visible/near-infrared single- 
photon resonances within the valence band (Fig. 1b inset, red arrow). 
The emergence of these resonances directly implies a strong transient 
polarizability””°. 

This field-induced transformation is predicted to be reversible and 
highly nonlinear (as suggested by the resultant j,(t) in Fig. 3c and Sup- 
plementary Fig. 3). The consequence is a sub-cycle increase in the 
polarizability of the system, which leads to an asymmetric charge 
displacement along the field vector: the averaging of the current to 
zero that is inherent in linear processes is eliminated. Consequently, 
net charge accumulations of opposite sign—dependent on the wave- 
form (Fig. 2) or on the delay between injection and driving fields 
(Fig. 3)—form at the opposite facets of the dielectric and transfer to 
the electrodes. 

Our experiments reveal an ultrafast ‘turning on’ of the measured 
current. At the same time, they do not provide direct evidence for the 
similarly fast ‘turning off predicted by our theory. A proof of the 
ultrafast turn-off behaviour of the underlying field-induced nonlinear 
polarization and conduction band population was provided by recent 
time-resolved absorption and reflection experiments*’. This suggests 
that conductivity and, consequently, current can be switched on and 
off in a dielectric using optical fields on a timescale of less than, or of 
the order of, 1 fs, and that this can be done without incurring much 
dissipation. This operation cycle is similar to that occurring in a field- 
effect transistor, but is fundamentally more energy efficient: when 
switching, the field-effect transistor dissipates by electron-hole recom- 
bination all energy stored in it, whereas the dielectric in our experi- 
ment returns almost all stored energy to the injection field. These 
possibilities may have ramifications for overcoming the speed limits 
of semiconductor electronics. In the shorter term, our work holds 
promise for the development ofa solid-state device for direct sampling 
of electric transients with bandwidths extending to optical frequencies. 


METHODS SUMMARY 

Source of strong ultrashort optical waveforms. A customized titanium-sapphire 
chirped-pulse amplifier produces linearly polarized, visible/near-infrared laser 
pulses with controllable gc, a pulse energy of ~400 pJ and a repetition rate of 
3kHz. The ultrabroad spectrum of the laser spans the 450-1,100-nm spectral 
range, supporting a sub-4-fs pulse duration (full width at half maximum of the 
intensity temporal profile), which corresponds to a ~1.5-cycle pulse at the carrier 
wavelength of ~750nm. The laser pulses can be focused onto the investigated 
solid-state system such that the cycle-averaged peak intensity reaches 10'* W cm” 
(see Supplementary Fig. 2 for properties of the laser). 

Photoactive metal-dielectric nanocircuit. The photoactive circuit on which the 
experiments have been performed consists of a metal—dielectric-metal nanojunc- 
tion. This structure is fabricated by cleaving amorphous SiO, (Crystec GmbH) and 
coating the adjacent surfaces of the atomically sharp cleaved silica edge with 
~50 nm of gold evaporated at grazing incidence. We obtain regular, straight, metal 
electrodes isolated from each other by dielectric nanoscale trenches with widths of 
the order of ~50 nm. No voltage bias is applied to the electrodes during the experi- 
ment. The junction is coated with a supplementary sputtered silica nanofilm such 
that the metal electrodes are embedded in a homogeneous silica matrix. See 
Supplementary Information for more details on the sample structure and experi- 
mental set-up. Identical measurements were made on samples composed of cleaved 
monocrystalline SiO,(001) (Crystec GmhH) and on flat, ~500-nm gold-SiO,-gold 
nanogaps. Both yielded results very similar to those reported in this work, but it 
was possible to distinguish possible effects arising from the lack of long-range order 
in the amorphous insulator and from local plasmonic resonances at the metal- 
dielectric interface. These points are discussed in Supplementary Information. 
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Detection scheme for the optically induced gc,-dependent electric signal. In 
this work, we present experimental data on the cg-dependent component of the 
optically generated electronic signals in the circuit. We isolate the electric current 
induced by the electric field of the optical waveform from the gc,;-independent 
contributions (for example electric currents caused by photo-ionization of gold 
due to imbalanced irradiation of the electrodes). The observable of interest is 
filtered by synchronizing the pulse train from the chirped-pulse amplifier in such 
a way that two consecutive ultrashort light bursts have a gc,-change of m. This 
results in a modulation of gc, at half the repetition rate of the pulse train. The 
optically generated electronic signal in the nanocircuit is measured using a current- 
voltage converter with a variable high gain and a bandwidth supporting the laser 
pulse train. The g-~-dependent component of the current is extracted with a lock-in 
amplifier locked at the gc, modulation frequency. The presented data correspond 
to experiments performed in ambient atmosphere at ~20 °C. We carried out iden- 
tical measurements in vacuum, yielding the same results. 
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Controlling dielectrics with the electric field of light 


Martin Schultze’, Elisabeth M. Bothschafter’?, Annkatrin Sommer’, Simon Holzner’, Wolfgang Schweinberger’, Markus Fiess!, 
Michael Hofstetter’, Reinhard Kienberger’’, Vadym Apalkov’, Vladislav S. Yakovlev?, Mark I. Stockman‘* & Ferenc Krausz!? 


The control of the electric and optical properties of semi- 
conductors with microwave fields forms the basis of modern elec- 
tronics, information processing and optical communications. The 
extension of such control to optical frequencies calls for wideband 
materials such as dielectrics, which require strong electric fields to 
alter their physical properties’*. Few-cycle laser pulses permit 
damage-free exposure of dielectrics to electric fields of several volts 
per angstrém® and significant modifications in their electronic 
system® "*. Fields of such strength and temporal confinement can 
turn a dielectric from an insulating state to a conducting state 
within the optical period’*. However, to extend electric signal con- 
trol and processing to light frequencies depends on the feasibility 
of reversing these effects approximately as fast as they can be 
induced. Here we study the underlying electron processes with 
sub-femtosecond solid-state spectroscopy, which reveals the feasi- 
bility of manipulating the electronic structure and electric polari- 
zability of a dielectric reversibly with the electric field of light. 
We irradiate a dielectric (fused silica) with a waveform-controlled 
near-infrared few-cycle light field of several volts per angstr6m 
and probe changes in extreme-ultraviolet absorptivity and near- 
infrared reflectivity on a timescale of approximately a hundred 
attoseconds to a few femtoseconds. The field-induced changes fol- 
low, in a highly nonlinear fashion, the turn-on and turn-off beha- 
viour of the driving field, in agreement with the predictions of 
a quantum mechanical model. The ultrafast reversibility of the 
effects implies that the physical properties of a dielectric can be 
controlled with the electric field of light, offering the potential for 
petahertz-bandwidth signal manipulation. 

A dielectric subjected to a weak optical field reacts to its change 
instantly (adiabatically) as long as the laser frequency ©, <<Agap/fr 
where 4 gap is the gap between the valence band and conduction band; 
for silica Agay ~ 9 eV (numerical values below are given for this mater- 
ial). When the strength of the electric field F approaches the criti- 
cal field strength, inducing a change in electron potential energy by 
Agapover the lattice period a ~ 5 A then 


ys aye (1) 
lela 

(where e is the electron charge), and Zener-type transitions’ inject 
electrons into the entire conduction band within its spectral width 
A-~10 eV. This by itself, irrespective of w,, leads to ultrafast dynamics 
of the induced broadband polarization on a timescale of i/A,~0.1fs. 
Hence, real-time access to strong-field-induced dynamics in dielec- 
trics calls for sub-femtosecond temporal resolution. In our work, 
this was provided by the envelope of sub-100-attosecond extreme- 
ultraviolet (XUV) pulses and the controlled instantaneous field of 
few-femtosecond near-infrared (NIR) laser pulses. 

In a first set of experiments, the waveform-controlled linearly pola- 
rized field, F,(t), of NIR laser pulses of less than 4 fs (carrier wave- 
length 2, =780 nm and wave cycle T, =27/, =2.6 fs), along with 
isolated 72-as XUV pulses carried at Awyyy~105 eV and polarized 
parallel to the laser field, impinged collinearly on a free-standing 


125-nm SiO, film (see Fig. la and Methods). The XUV pulse probed 
the processes induced by F,,(¢) by promoting electrons from the L-shell 
of silicon into the conduction band states. Transient changes in the 
conduction band are reflected in the XUV spectra recorded as a func- 
tion of delay of the XUV probe with respect to the NIR-field excitation; 
see Fig. lc. We note that the XUV pulse induces polarization over 
the entire band 4./h, which largely dephases within tgephasing ~// 
A.~100as after passage of the XUV pulse through the sample. The 
XUV light radiated from the sample coherently with the XUV probe 
field therefore decayed within about 100 as after the XUV pulse. This 
permitted—in contrast with previous experiments in atomic gases'°— 
us to record attosecond transient absorption spectra that provide 
information about the evolving state of the quantum system at the 
instant of probing and are not appreciably affected by the ongoing 
strong-field excitation afterwards. 

We supplemented attosecond probing with simultaneous streaking’® 
of the excitation field F,(t) (Fig. 1b) in a neon gas jet in front of the SiO, 
sample. Figure la displays the laser field and the XUV pulse envelope 
retrieved from this measurement. Simultaneous implementation of 
these two attosecond techniques permitted assignment of the transient 
absorption spectra to well-defined moments within the excitation field, 
F(t), providing absolute timing of the processes under scrutiny. 

For a detailed analysis, we evaluate the absorbance A, given by 
A(hoxuy) = «(h@xuy)d, where «(i@xuy) is the absorption coefficient 
at the photon energy im uy, and d is the thickness of the sample. We 
first discuss XUV absorption near the edge of the conduction band. 
The solid line in Fig. 2b shows the transient change in A induced by the 
NIR field plotted in Fig. 2a; its measured peak electric field strength is 
Fo =2.5+0.5V Al Itreveals a large (> 10%) increase of the sample’s 
XUV transmittance over sub-femtosecond intervals. The oscillations 
in absorbance at a frequency of 2, are accompanied by similar oscil- 
lations in the position of the conduction-band edge (Fig. 2c). The 
measurement resolves condensed-matter processes driven by the 
instantaneous field of visible light. The changes emerge within a single 
half-cycle of the driving field and terminate with similar abruptness 
several femtoseconds later; see also the inset of Fig. 2b. The XUV 
absorption bleaching extends, with similar temporal behaviour, over 
the entire conduction band; see Supplementary Fig. 1. 

To analyse the observed field-induced modifications, we adapted— 
with exactly the same parameters—the quantum-mechanical model 
developed for describing electric currents in dielectrics induced 
by strong, few-cycle NIR fields under physical conditions identical 
to the experiments of this work". Briefly, it is based on the time- 
dependent Schrédinger equation in the presence of the measured field 
F,(t), incorporating field screening but neglecting electron collisions 
and excitonic effects. For a detailed description, see Supplementary 
Information. The model accurately predicts the magnitude and the 
sub-femtosecond temporal structure of the observed modulation in 
strength (Fig. 2b) and position (Fig. 2c) of the 109-eV absorption 
line, as well as the bandwidth and the strength of the XUV absorp- 
tion bleaching (Supplementary Fig. 1). Agreement in the phase of these 
modulations with respect to the oscillating laser field (Fig. 2b, c) 
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Figure 1 | Simultaneous attosecond absorption and streaking spectroscopy. 
a, Schematic illustration of the experimental set-up. TOF, time of flight. 

b, Attosecond streaking spectrogram of the 72-attosecond (full-width at half 
maximum) isolated XUV pulse and a near-single-cycle NIR pulse used in the 
experiments. The spectrogram was recorded by measuring the energy spectrum 
of photoelectrons released by the XUV pulse into the NIR field as a function 
of NIR-XUV delay. c, Attosecond transient absorption spectrogram of a 


supports our understanding that it is the instantaneous field that drives 
the processes. 

In a second set of experiments, we scrutinized the influence of the 
strong field on the polarizability in the NIR range by studying the 
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125-nm-thick SiO, membrane in the range 105-112 eV as a function of delay 
between the sub-100-attosecond XUV pulse and the NIR laser pulse with a 
delay step of Atgelay = 100 as (the black line shows the synchrotron-XANES 
absorption data”* for comparison). d, Schematic energy level diagram of SiO>. 
The blue arrow represents the XUV absorption from Si L-states to the SiO, 
conduction band. The 9-eV bandgap between valence and conduction band 
exceeds the NIR photon energy by a factor of 6, as indicated by the red arrow. 


transmitted and reflected NIR light (for details, see Methods and 
Supplementary Information). Within our experimental accuracy of 
about + 1.2% (dictated by the laser pulse-to-pulse stability), the trans- 
mittance of our NIR beam did not vary up to the intensity threshold 
for breakdown. In contrast, the power of the reflected beam, detected 
in a z-scan geometry, increased by nearly an order of magnitude for 
field strengths approaching the breakdown threshold as shown in 
Fig. 3a. To ensure the highest contrast for the subsequent pump- 
probe study (see below), this measurement was implemented with 


Figure 2 | Attosecond time-resolved strong-field-induced effects in SiO. 
Solid lines are experimental results; dashed lines are predictions of theoretical 
modelling. a, Electric field of the few-cycle NIR laser pulse impinging on the 
SiO, sample, F,(t), as extracted from attosecond streaking (see Fig. 1b). 

b, Transient change of the absorbance A(i@yyy) = «(i@xyy)d integrated over 
a 1-eV bandwidth at im = 109 eV, as a function of the delay (the delay step is 
Atgelay = 100 as) between the 72-as XUV probe and the NIR laser pulse (blue 
solid line), along with the prediction of our quantum mechanical model (red 
dashed line). The inset shows the evolution of A in a more extended delay range, 
recorded with larger delay steps (Atgclay = 0.5 fs). The error bar in b represents 
the standard error of the average over 15 spectral lineouts within the energy 
range 108.5-109.5 eV; it exhibits little variation over the delay scan. The dashed 
violet line is the calculated local density of states (LDOS) at the position of a Si 
atom (integrated over the energy range accessed by the XUV pulse, for more 
details, see Supplementary Information) versus delay of the XUV probe. 

c, Energy of the absorption peak at 109 eV subject to an optical-field-induced 
(dynamic Stark) shift (the blue solid line shows the measurement, the red 
dashed line shows the calculation). The error bar in c represents the 95% 
confidence interval of the peak position of a least-squares fit of the area of a 
Gaussian function to the area under the absorption line; it exhibits little 
variation over the delay scan. 
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light impinging at Brewster’s angle and polarized in the plane of 
incidence (p-polarized). The dependence on the field strength was 
extremely nonlinear, closely resembling that of the field-induced 
current'*. Our model is not yet applicable to an oblique angle of 
incidence but did correctly predict the field-strength-dependence of 
the current’. The similarity in the intensity scaling of the optical 
current and reflectivity points to their common physical origin: the 
field-enhanced polarizability. 

Is this effect mainly due to a conduction-band population surviving 
the few-cycle field or to reversible effects being not only turned on but 
also turned off by the field? Although the XUV bleaching already sug- 
gests that the latter is more likely, we sought additional evidence using 
time-resolved optical reflectometry. To this end, we split the incident 
pulse into a strong pump and a weak probe beam, with the sum of the 
two field maxima being slightly below the breakdown threshold, and 
measured the reflected probe intensity as a function of the delay At; see 
the sketch of the apparatus in Fig. 3a and the result in Fig. 3b. 

Comparing the reflected probe signal S at delays At= +7, we 
found the signal to be significantly greater in the immediate aftermath 
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Figure 3 | Wave-cycle-resolved NIR femtosecond probing of strong-field- 
induced nonlinear reflectivity of SiOz. a, Reflected power of p-polarized NIR 
laser pulses of less than 4 fs incident at Brewster’s angle on the thin- wedged 
sample of fused silica as a function of peak electric field (penetrating into the 
sample). The process is fully reversible for several thousand laser shots before 
irreversible damage occurs owing to self-focusing inside the sample rather than 
on the surface. The data points are averaged over 3,000 laser pulses (error bars 
represent the standard deviation). Inset, schematic illustration of the pump- 
probe transient reflectivity set-up (for description, see Methods). b, Reflected 
power on axis of the probe pulse of less than 4 fs (represented by the red beam; 
intensity envelope I,,obe) as a function of the delay with respect to the strong 
pump pulse of less than 4 fs (illustrated by the violet beam; intensity envelope 
Ipump): The curve is the result of an average of ten scans; for representative 
individual scans and details, see Supplementary Fig. 2 and the related 
description in the Supplementary Information. The reflected probe signal at 
delays of At = + T, is denoted by S( + T,). ¢, Electric field of the few-cycle NIR 
laser pulse impinging on the SiO, sample, F,(t), as extracted from attosecond 
streaking (see green solid line in Fig. 1b), and the resulting nonlinear 
polarization P,(t), normalized to 9 Fo, predicted by our model (purple dashed 
line). d, Computed transient evolution of the population of (unperturbed) 
conduction band states timed to the laser field. e, Peak transient conduction 
band population versus peak applied field strength. The grey shading 
represents the experimental value of the optical breakdown threshold, as 
determined from the reflectance measurement yielding the data shown in a. 


of the strong pulse, S(T,)/S(— T,) =2.1+0.2 (with no measurable 
dependence on the carrier-envelope phase to within the measurement 
accuracy limited by intensity fluctuations). However, comparing 
signals on opposite fringe peaks at At~ +2T,, +3T},..., showed no 
appreciable asymmetry—S(nT,)~S(—nT,), n> 1—within our experi- 
mental precision. If a significant fraction of the induced nonlinear 
reflectivity (and, consequently, polarization) survived the pulse peak 
over several cycles, this would imply S(nT,) significantly larger than 
S(—nT,), owing to a much larger fraction of the probe pulse experi- 
encing the increased reflectivity. Our observation thus provides com- 
pelling evidence, in a model-independent manner, for the substantial 
reversibility of the strong-field-induced increase in polarizability on the 
timescale of the optical wave cycle. 

Figure 3c contrasts the applied electric field F,(t) (solid line) with 
the calculated field-induced nonlinear polarization P,)(t) normalized 
to &9Fo (dashed line). The strongly sharpened half cycles of P(t) point 
to an extremely nonlinear scaling with driving field strength, consist- 
ent with the data in Fig. 3a. The nonlinear response is largely confined 
to the central laser cycle and disappears completely before the end of 
the pulse. This effect is accompanied by a field-induced transient 
population and subsequent depopulation of the conduction band, 
confined, again, to several femtoseconds (see Fig. 3d). For field 
strengths beyond Fy~2.5V A ', Fig. 3e shows a near-exponential 
growth of conduction band population, causing breakdown. In fact, 
the maximum field we could safely apply to our samples before break- 
down was Fy=2.5 V Av Thus, our model also correctly predicts the 
threshold for optical breakdown (Fig. 3e). Both the nonlinear polari- 
zation and the conduction band population induced by the strong 
field return to near-zero immediately after the laser pulse for 
Fy) <2.5VA |. This is fully consistent with the abrupt decay of 
field-induced transient NIR reflectivity and XUV absorption bleach- 
ing: see Figs 3b and 2b, respectively. These results imply that the 
sample exposed to fields as high as Fy <2.5 V A” ' resumes its original 
(field-free) state immediately after exposure. This points towards the 
reversible, Hamiltonian nature of the laser-induced dynamics up to the 
critical field strength. 

Our quantum mechanical model identifies light-field-induced re- 
versible changes of the local density of states (Fig. 2b), the conduction- 
band population, and the polarizability as the mechanisms responsible 
for transient XUV absorption bleaching, NIR reflectivity enhance- 
ment, optical breakdown and optical-field-induced currents'*. The 
model accounts for all these phenomena with a set of parameters 
adjusted in ref. 14, which we did not modify in this work. The ability 
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to predict such a large aggregate of disparate physical effects properly 
without parameter adjustment provides convincing evidence of the 
theory’s validity. 

Considerations presented in ref. 14 provide a hint towards a possible 
intuitive picture to explain the above effects in terms of the well-known 
phenomenon of Wannier-Stark localization’? *'. The field-induced 
strong transient localization of states is expected to occur in all bands, 
allowing XUV-induced transitions predominantly between states 
localized at the same site of the lattice. The energy levels of such 
colocalized Wannier-Stark states of different bands are shifted identi- 
cally by the field, leaving the XUV transition energies (between the 
L-band and conduction band) nearly unchanged. Hence, XUV prob- 
ing is insensitive to these energy shifts, apart from the small dynamic 
shifts observed in Fig. 2c, but uncovers dynamic changes in the density 
of states. The Wannier-Stark ladder formation delivers an intuitive 
account for the computed decrease of the local density of states 
(Fig. 2b), which appears to be mainly responsible for the decrease of 
the XUV absorption. The field-enhanced polarizability of the system 
can also be related to the strong Stark shifts in the conduction band 
and valence band: they pull interband transition frequencies into 
resonance with the visible-NIR spectrum of the driving field'*. The 
resultant enhancement in polarizability appears to be the common 
origin of the optical-field-induced reflectivity observed here and the 
currents observed in ref. 14. In spite of the consistency of this argu- 
ment, the intuitive Wannier-Stark picture remains hypothetical until 
directly verified by experiment, for example, by probing valence band- 
conduction band transitions with sub-femtosecond resolution. 

In conclusion, the synergistic use of attosecond streaking and 
absorption spectroscopy and wave-cycle-resolved optical reflectivity 
measurements provide real-time insight into strong-field phenomena 
in dielectrics on sub-femtosecond to few-femtosecond timescales. 
The experiments reveal, in agreement with quantum-mechanical 
modelling, that field-induced changes of all the physical quantities 
studied are reversible up to the critical field strength, that is, they 
can be turned on and off on the timescale of the optical period. 
Controlling the properties of dielectrics with the instantaneous electric 
field of light points the way towards petahertz signal sampling and 
processing technologies. 


METHODS SUMMARY 

XUV transmittivity. Time-resolved XUV absorption spectroscopy was per- 
formed with broadband, isolated attosecond XUV pulses generated via high har- 
monic generation of visible-infrared laser pulses of less than 4 fs (FemtoPower 
Compact Pro, Femtolasers) in neon. The collinear laser and XUV beam were 
separated into two arms of a Mach-Zehnder-type interferometer. The relative 
timing between the two arms could be adjusted with a mirror moved by a piezo 
translator, and a variable aperture controls the infrared intensity on target. For 
details, see the Supplementary Information and ref. 22. The XUV and NIR beams 
were focused with a toroidal mirror on free-standing chemical-vapour-deposited 
SiO, samples (either 125nm or 250nm thick). The XUV beam transmitted 
through the sample was spectrally dispersed by a flat-field grating and projected 
ona XUV-sensitized camera’’. Our measurements were performed near (to within 
<10%) optical breakdown. NIR-field-induced breakdown manifested itself as 
macroscopic mechanical damage of the sample, which became immediately visible 
in the magnified image of the focal region, which was permanently monitored. 
NIR transmittivity and reflectivity. Field-induced changes in NIR transmittivity 
through 125-nm films and the reflectivity of thin (~0.2 mm) wedged plates were 
studied with laser pulses of less than 4 fs gently focused and the intensity on the 
sample was varied by translating it longitudinally along the optical axis (z-scan). 
The pump-probe measurement is sketched schematically in Fig. 3a. The incident 
NIR beam was split into two collinear beams by a two-component plane mirror 
consisting of a 5-mm-diameter inner section and a concentric external section. 
The inner and outer sections reflected the weak probe and the strong pump beam 
with a reflectivity of 10% and 97.5%, respectively. The energy of the pump pulse 
was controlled by an aperture. The probe pulse was delayed by translating the 
inner section with a piezo stage and focused along with the pump pulse onto the 


78 | NATURE | VOL 493 | 3 JANUARY 2013 


sample. The probe beam was isolated with an iris in front of the detector mea- 
suring the reflected probe pulse energy as a function of pump-probe delay. A more 
detailed description of the Methods is given in the Supplementary Information. 
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Probabilistic cost estimates for climate change 


mitigation 
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For more than a decade, the target of keeping global warming below 
2°C has been a key focus of the international climate debate’. In 
response, the scientific community has published a number of 
scenario studies that estimate the costs of achieving such a target” >. 
Producing these estimates remains a challenge, particularly because 
of relatively well known, but poorly quantified, uncertainties, 
and owing to limited integration of scientific knowledge across 
disciplines®’. The integrated assessment community, on the one 
hand, has extensively assessed the influence of technological and 
socio-economic uncertainties on low-carbon scenarios and asso- 
ciated costs” *’. The climate modelling community, on the other 
hand, has spent years improving its understanding of the geo- 
physical response of the Earth system to emissions of greenhouse 
gases* '*, This geophysical response remains a key uncertainty in the 
cost of mitigation scenarios but has been integrated with assess- 
ments of other uncertainties in only a rudimentary manner, that 
is, for equilibrium conditions®'*. Here we bridge this gap between 
the two research communities by generating distributions of the 
costs associated with limiting transient global temperature increase 
to below specific values, taking into account uncertainties in four 
factors: geophysical, technological, social and political. We find that 
political choices that delay mitigation have the largest effect on the 
cost-risk distribution, followed by geophysical uncertainties, social 
factors influencing future energy demand and, lastly, technological 
uncertainties surrounding the availability of greenhouse gas miti- 
gation options. Our information on temperature risk and mitiga- 
tion costs provides crucial information for policy-making, because 
it clarifies the relative importance of mitigation costs, energy 
demand and the timing of global action in reducing the risk of 
exceeding a global temperature increase of 2 °C, or other limits such 
as 3 °C or 1.5 °C, across a wide range of scenarios. 

We generate cost distributions by combining mitigation cost esti- 
mates of emissions scenarios with probabilistic temperature projec- 
tions. Importantly, our cost estimates do not account for any avoided 
climate damages as a result of emission reductions. This information is 
obtained from a large set of scenarios created with an integrated assess- 
ment model'*"’, for which the temperature increase is computed 
with a probabilistic climate model'*”” (Fig. 1, Supplementary Fig. 1, 
Methods and Supplementary Information). Each modelling frame- 
work has inherent limitations. For example, although it incorporates 
state-of-the-art uncertainty quantifications of the Earth system, our 
model does not fully explore tipping points. Similarly our energy- 
economic emissions scenarios map a wide range of possible futures 
(Supplementary Figs 7 and 8) but are not exhaustive of all potential 
outcomes (Supplementary Information). 

Temperature projections for any given pathway have a spread 
owing to geophysical uncertainties’* (Fig. 1b). In the absence of any 
serious mitigation efforts (present global carbon prices of less than 
US$1 per tonne of carbon-dioxide-equivalent emissions (tCO,e ')), 
the likelihood of limiting warming to less than 2 °C is essentially 


zero (<1%; Fig. 1c). However, imposing a carbon price of about 
US$20tCO,e * in our model would increase the probability of 
staying below 2°C to about 50%, and carbon prices of more than 
US$40 tCO,e * would achieve the 2 °C objective with a probability 
of more than 66% (‘likely’ by the definition of the Intergovernmental 
Panel on Climate Change’). Similar trends hold for other cost metrics 
(Supplementary Information). For example, a carbon price of US$20- 
40tCO,e * translates in our model to cumulative discounted mitiga- 
tion costs (2012-2100) of the order of 0.8-1.3% of gross world product 
(Supplementary Fig. 10). 

A marked feature of the mitigation cost distribution (Fig. 2) is that 
the probability of global warming staying below 2 °C levels off at high 
carbon prices. This occurs because, beyond a given carbon price, nearly 
all mitigation options that can substantially influence emissions in the 
medium term have been deployed in our model. Higher carbon prices 
help further to reduce emissions later in the century, but only affect 
temperatures after peaking”’. Hence, the probability of staying below 
2°C during the twenty-first century reaches an asymptote. 

Geophysical uncertainties shed light on only one aspect of mitiga- 
tion costs, however. To gain insight into how assumptions regarding 
technological and social uncertainties influence our cost distribution, 
we create a large set of sensitivity cases (Table 1), in which we vary 
some salient features of the scenarios, namely the availability and use 
of specific mitigation technologies; future social development and, by 
extension, global energy demand; and the international political con- 
text surrounding climate mitigation action, specifically delays in the 
implementation of a globally comprehensive mitigation response’ 
(Supplementary Information). We note that population and economic 
growth do not vary in our scenarios; we therefore cannot assess their 
relative importance with our ensemble (Supplementary Information). 
Given its policy relevance”', we focus most of our discussion on the 
limit of 2 °C (Supplementary Figs 4 and 5 illustrate the results for 2.5 
and 3 °C, respectively). 

Our results can be framed in two ways (Fig. 2): first, in terms of how 
probabilities for achieving the 2°C objective change for a fixed cost 
(black arrows); and, second, in terms of how the cost consistent with 
the 2°C goal varies for a given probability level (orange arrows). 
Whether or not a carbon price of about US$40 tCO,e ' restricts global 
warming to less than 2 °C with a likelihood of more than 66% depends 
on the future availability of key mitigation technologies (Fig. 2a). In our 
worst-case technology-sensitivity assumption—that capture and geo- 
logical storage of carbon (CCS) is entirely unavailable—the probability 
of staying below 2 °C at a carbon price of US$40 tCO,e | decreases to 
around 50%. However, with no such constraints and further technolo- 
gical breakthroughs (Table 1), the likelihood of limiting warming to 
2°C could be higher than 66% at the same carbon price. 

The cost distributions also show how changes in technological mea- 
sures affect the economics of mitigation given a fixed probability level. 
For example, in most cases the 2 °C objective can be achieved with a 
probability of more than 66% as long as the carbon price is high 
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Figure 1 | Methodology for creating cost-risk relationships for a given 
temperature limit. a, Illustrative set of emissions scenarios with carbon prices 
increasing from zero to US$1,000 tCO.e '. The arrow at right indicates the 
direction of increasing carbon prices across the illustrative set. The blue- 
highlighted scenario has a global carbon price discounted back to 2012 of 
US$21 tCO,e" *. b, Probabilistic temperature projections for the blue trajectory 
in a. Horizontal lines at 2, 2.5 and 3 °C show possible target temperature limits. 


enough (Fig. 2a). In certain instances, however, the lack of mitigation 
options (such as renewable technologies, nuclear power or limited 
biomass and afforestation potential) could require substantially higher 
carbon prices to keep this target viable. At the limit is CCS: the com- 
plete elimination of this mitigation option—either for technological 
reasons or as a result of social and political concerns—would put the 
2°C objective (with more than 66% probability) out of reach in our 
model, no matter how high the carbon price. 

The future availability of energy supply technologies (for example 
renewable technologies and CCS) tells only one side of the story; a 
strong finding from our analysis is that social developments influ- 
encing energy demand (that is, efficiency of energy use) are even more 
important. This is evidenced in Fig. 2b by the differences between three 
distinct scenario families whose future energy demands vary greatly 
(low, intermediate and high; see Table 1 and ref. 7 for details). In the 
low-demand scenarios, end-use efficiency measures and conservation- 
minded energy and urban planning policies are instituted ubiquitously 
throughout the industrial, building and transportation sectors in all 
countries. This leads to a global energy demand in 2050 that is about 
25% lower than our intermediate baseline, which broadly applies 
historical patterns of efficiency improvement’. Such reductions in 
demand could be crucial in keeping the 2 °C objective within reach, 
independently of what happens in terms of energy supply. 

For example, in our scenarios the availability of nuclear power has 
an almost negligible effect on overall mitigation costs compared with a 
switch from a scenario with an intermediate energy demand to one 
with a high demand. Low-demand strategies would ensure a higher 
likelihood of staying below 2 °C for the same carbon price (from 66% 
to more than 80% at US$40 tCO,e '), or, viewed in a different way, 
would drastically reduce the cost of reaching the ‘likely’ probability 
level (from US$40tCO e | to around US$10-15tCO,e '). In con- 
trast, a high-energy-demand future—about 20% greater in 2050 than 
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In this illustrative scenario, median (50% probability) warming is 2.0 °C. There 
is a slim chance (<5%) that temperatures remain below 1.3 °C and a large 
chance (>90%) that they remain below 3.0 °C. c, Cumulative distributions of 
carbon prices consistent with limiting warming to below 2, 2.5 and 3 °C, as 
indicated. Blue dots indicate points defined by the cost information of the 
scenario highlighted in a and the probabilistic temperature projection in b. 


the intermediate baseline, resulting from more energy-intensive life- 
styles and less efficiency- and conservation-focused policies—would 
require much higher carbon prices (>US$150 tCO,e ') and make it 
much more difficult, if not impossible, to reach the 2 °C objective with 
a probability of more than 66%. 

Overall, Fig. 2b indicates that the present influence of geophysical 
uncertainties on the spread in mitigation costs to achieve the 2 °C 
objective is comparable to that of the uncertainties arising from different 
future pathways for social development and technological changes and 
choices. The maximum difference in probability of staying below 2 °C 
between the least costly (blue-dashed) and the most costly (red-dotted) 
distribution is slightly greater than 60 percentage points. This roughly 
matches the range of probabilities found when taking into account 
the Earth system uncertainty under the same supply and demand 
assumptions (for example 0-70% for the entire range of carbon prices 
in our central case, which assumes a reference technology portfolio and 
intermediate energy demand). Such a finding is broadly consistent with 
earlier studies comparing the relative contributions of geophysical and 
technological factors” using a non-probabilistic approach. 

Yet, despite all of the uncertainty in the geophysical, social and 
technological aspects, our analysis indicates that the dominant factor 
affecting the likelihood and costs of achieving the 2 °C objective is 
politics. Here we model political uncertainties by varying the timing 
of concerted global mitigation efforts. Although studies of the implica- 
tion of delays in climate action are not new*”’*°, our results show how 
geophysical uncertainties interact and compare with political inertia: if 
global temperature rise is to be kept below 2 °C with a probability of 
more than 66% under central technology and energy demand assump- 
tions, our scenarios show that immediate and globally coordinated 
mitigation action is necessary (Fig. 2c; Supplementary Information 
provides an explanation of ‘immediate’). Only for low-energy-demand 
pathways can global mitigation action be delayed until 2020 and the 
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Figure 2 | Influence of mitigation technology, energy demand and political 
inaction on the cost-risk distributions for staying below 2 °C. Cost 
distributions for six cases with varying future availability of specific mitigation 
technologies (a) and three sensitivity cases for future energy demand (b, thick 
solid lines). Shaded areas and dashed lines in b represent technology-sensitivity 
cases comparable to those shown in a. Shaded areas and dashed lines in 


2 °C objective still be achieved with a probability of more than 66% (or 
delayed until 2030 with a probability of 50%; Fig. 2d). 

In conclusion, we find that the effect of global mitigation action 
delayed by two decades is much more pronounced than the conse- 
quences of uncertainty surrounding mitigation technology availability 
and future energy demands, and renders even the geophysical uncer- 
tainties almost irrelevant for the 2 °C objective (Fig. 2d, Supplementary 


Table 1 | Overview of sensitivity cases 
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d represent technology- and politics-sensitivity cases comparable to those in 
band c, respectively. c, Impact of delayed global mitigation action. d, Overview 
figure combining all sensitivity cases. The horizontal line in a—c is the 66% line. 
Similar figures for 2.5 and 3 °C are provided in Supplementary Figs 4 and 5. 
A comparison with 91 scenarios from the literature’ is provided in 
Supplementary Fig. 7. 


Table 2 and Supplementary Fig. 9). Furthermore, we find asymptotic 
limits to increasing the probability of reaching a given temperature 
objective in our model: if mitigation action is delayed, simply spending 
more money on the problem in the future will not increase this proba- 
bility beyond certain limits imposed by the Earth system. 

Our mitigation cost distribution methodology can also be applied to 
other temperature objectives, for example a weaker limit (3 °C) or a 


Mitigation technology 


Technological limits 
No new nuclear 


From 2020 onwards, no new investments are made into nuclear power, leading to a full phase-out of existing plants by 2060. 


Limited land-based measures The mitigation potential from biomass, land use and forestry is limited. 


No CCS 

globally meaningful scale. 

Technological breakthroughs 
Advanced transportation 


Technology to capture and geologically store CO2 (CCS) from fossil fuel and/or biomass energy never becomes available on a 


Fundamental changes in transportation infrastructures (for example for electric transport) or major breakthroughs in transportation 
technology (for example in hydrogen fuel cells) lead to increased decarbonization of the transportation sector. 


Advanced non-COz mitigation The mitigation potential of non-CO2 greenhouse gases is assumed to improve continuously, beyond the level of current best practice. 


Energy demand 


Intermediate demand The development of energy demand and efficiency improvements is broadly consistent with (only slightly faster than) what is 


° 


bserved historically. 
High demand 
Low demand 


Energy efficiency improves more slowly than historically observed, leading to a high future energy demand. 
Energy efficiency improves radically in all end-use sectors (buildings, industry, transport) leading to low future energy demand. 


Political inaction 


Delayed action 


Globally concerted mitigation action is postponed from today until 2015, 2020, 2025 and 2030 in respective cases. 


Detailed descriptions and background are provided in Supplementary Information and ref. 7. 
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Figure 3 | Cost-risk distributions for returning global temperature increase 
to below 1.5 °C by 2100. a, Cost distributions for three cases of varying future 
energy demand (solid red, green, and blue lines) and varying future availability 
of specific mitigation technologies (shaded ranges around solid lines). Ranges 


stricter limit (1.5 °C), the second of which has already been discussed 
in the policy arena”’. We find that unless energy demand is low, CCS 
technology is available and global climate action is undertaken imme- 
diately, holding temperature increase to below 1.5 °C by 2100 with a 
probability of at least 50% is already unfeasible (Fig. 3a). In terms of 
costs, this would require the immediate introduction of global carbon 
prices of more than US$40tCO,e * (increasing over time with the 
discount rate). If global mitigation action were delayed by 10 to 20 
years, a carbon price of US$40tCO,e ' would yield probabilities of 
only 10-35%; and, even under higher prices, a 50% probability could 
no longer be reached under central technology and low-energy- 
demand assumptions (Fig. 3b). However, the same carbon price, 
US$40 tCO,e~', would prevent an increase in warming beyond 3 °C 
with a high probability (>90%) for all supply-demand combinations, 
contingent on the immediate global introduction of the pricing instru- 
ment (Supplementary Fig. 5). 

Our findings have implications for the ongoing international cli- 
mate policy discussions”’, which foresee a global agreement coming 
into effect only in 2020. For this delay strategy to be successful, national 
and local governments would need to place far greater importance 
on concurrent demand-side solutions to climate protection (thus 
lowering energy demand growth), as well as on voluntary or revised 
near-term mitigation policies and measures that anticipate and are 
consistent with a future stringent climate agreement. Our model 
results show that robustly safeguarding the future achievement of 
the oft-discussed 2 °C objective requires that society embarks on a 
higher-efficiency, lower-energy-demand course well before 2020 in 
the context of sustained, concerted and coordinated mitigation efforts. 


METHODS SUMMARY 


We create a large ensemble (n > 700) of emissions scenarios with MESSAGE™*”*, a 
global integrated assessment modelling framework with a detailed representation 
of greenhouse-gas-emitting sectors, by imposing cumulative constraints on green- 
house gas emissions (for all such gases: carbon dioxide, methane, nitrous oxide, 
halocarbons and fluorinated gases) of varying stringencies for the whole twenty- 
first century, and by changing salient features in the underlying scenario assump- 
tions (see Supplementary Information, Supplementary Fig. 1, Supplementary Table 1 
and ref. 7 for a full set of assumptions). Our scenarios assume ‘middle-of-the-roa@’ 
assumptions for socio-economic development from previous research on scenarios: 
population peaking at 9,700,000,000 later this century (United Nations median 
projection’**) and gross world product increasing more than sevenfold by 2100 
(updated Special Report on Emissions Scenarios B2 scenario projection by the Inter- 
governmental Panel on Climate Change’”’). 

We then compute probabilistic estimates of global temperature increase for 
each scenario with the MAGICC climate model’*'””°. These estimates are based 
on a 600-member ensemble of temperature projections for each scenario, which 
together closely represent the carbon-cycle and climate uncertainties as assessed 
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show the variation over all assessed technology-sensitivity cases. b, Influence of 
global mitigation action delayed from now until 2030. The vertical axis shows 
the probability of returning global average temperature increase to below 1.5°C 
by 2100. 


in the Fourth Assessment Report of the Intergovernmental Panel on Climate 
Change’’. Additionally, our temperature projections are also constrained by 
observations and estimates of hemispheric temperatures and ocean heat uptake 
(Supplementary Information). The probability of staying below a given temper- 
ature threshold is computed over the entire twenty-first century and relative to 
pre-industrial levels. In contrast to the 2 °C objective, the target of 1.5 °C is referred 
to as a long-term goal”', meaning that we allow a small, temporary overshoot and 
assess the probability of returning warming to below 1.5 °C by 2100. 

We present our results using carbon prices as the cost metric. For an illustration of 
our results using other cost metrics, such as total mitigation costs, see section 1.4 of 
Supplementary Information and Supplementary Figs 2 and 3. The carbon price shown 
is the price at the time action starts, discounted back to 2012 with a discount rate of 
5% per year (Supplementary Information, section 1.4, and Supplementary Fig. 6). 
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The oxidation state of the mantle and the extraction 
of carbon from Earth’s interior 


Vincenzo Stagno'+, Dickson O. Ojwang', Catherine A. McCammon! & Daniel J. Frost! 


Determining the oxygen fugacity of Earth’s silicate mantle is of 
prime importance because it affects the speciation and mobility 
of volatile elements in the interior and has controlled the character 
of degassing species from the Earth since the planet’s formation’. 
Oxygen fugacities recorded by garnet-bearing peridotite xenoliths 
from Archaean lithosphere are of particular interest, because they 
provide constraints on the nature of volatile-bearing metasomatic 
fluids and melts active in the oldest mantle samples, including 
those in which diamonds are found”’. Here we report the results 
of experiments to test garnet oxythermobarometry equilibria** 
under high-pressure conditions relevant to the deepest mantle 
xenoliths. We present a formulation for the most successful equi- 
librium and use it to determine an accurate picture of the oxygen 
fugacity through cratonic lithosphere. The oxygen fugacity of the 
deepest rocks is found to be at least one order of magnitude more 
oxidized than previously estimated. At depths where diamonds can 
form, the oxygen fugacity is not compatible with the stability of 
either carbonate- or methane-rich liquid but is instead compatible 
with a metasomatic liquid poor in carbonate and dominated by 
either water or silicate melt. The equilibrium also indicates that the 
relative oxygen fugacity of garnet-bearing rocks will increase with 
decreasing depth during adiabatic decompression. This implies that 
carbon in the asthenospheric mantle will be hosted as graphite or 
diamond but will be oxidized to produce carbonate melt through the 
reduction of Fe** in silicate minerals during upwelling. The depth of 
carbonate melt formation will depend on the ratio of Fe** to total 
iron in the bulk rock. This ‘redox melting’ relationship has import- 
ant implications for the onset of geophysically detectable incipient 
melting and for the extraction of carbon dioxide from the mantle 
through decompressive melting. 

Oxythermobarometry measurements** indicate that the oxygen 
fugacity (fo, ) recorded by spinel-bearing peridotite rocks, which origi- 
nate at a depth of 30-50 km in the mantle, ranges from approximately 
3log units below to 2log units above the fayalite-magnetite-quartz 
oxygen buffer (that is, the log( fo,) value determined relative to the 
buffering equilibrium 3Fe,SiO, + O2 = 2Fe3;04 + 3SiO2; we denote 
this value AFMQ). Abyssal peridotites, thought to be residues of 
melting to produce mid-ocean-ridge basalt (MORB), lie at the low 
end of this range, and xenoliths associated with subduction zones are 
in general at the higher end. Over this range of fo,, CO2 and H,O are 
the major volatile species, consistent with their dominance in erupting 
basaltic magmas”®. 

For garnet-bearing rocks, which originate at depths between 50 and 
220 km, the only commonly employed oxythermobarometry equilib- 
rium uses the Fe;Fe3*Si,O,, (skiagite) garnet component: 


2Fe3Fe3* Si;0);, = 4Fe.SiO, + 2FeSiO; +O, 


olivine orthopyroxene 


(1) 

garnet 
This equilibrium, which requires the precise determination of the ratio 
of Fe** to total Fe (Fe? "/~Fe) in garnet, has been extensively used to 
determine the redox state of xenoliths mainly from cratonic litho- 
sphere, but has only been experimentally tested for its accuracy in 


estimating fo, under conditions of 3 and 3.5 GPa and 1,300 °C, equi- 
valent to ~100-km depth’. The conclusion from the use of equilib- 
rium (1) has been that the fo, value of the Archaean lithosphere 
decreases with depth until conditions are close to those at which 
Fe-Ni metallic alloy would start to precipitate from mantle minerals** 
(5log units below the FMQ buffer). Under such conditions, volatile 
species in the mantle would be dominated by CH, and H rather than 
by H,0 and CO; (refs 8-10). 

To test oxythermobarometry equilibria, we performed multi-anvil 
experiments at high pressure and temperature on garnet peridotite 
assemblages and determined the Fe**/S “Fe ratio of garnet at fo, values 
that we measured using an Ir-Fe alloy redox sensor'!. Experimental 
capsules comprised layers of garnet powder (either natural or formed 
from an initial synthetic glass) sandwiched between layers containing a 
mixture of olivine, orthopyroxene and garnet. In most experiments, 
the oxygen fugacity was buffered by a mixture of graphite and carbo- 
nate mixed within the sample, although several experiments contained 
H,0 rather than carbonate and one experiment involved no carbon. 
Approximately 3% Ir was added to all layers of the experiments. This 
alloys with Fe from the silicates during the experiment, allowing fo, to 
be determined from the equilibrium 


Fe2SiO4 ae Mg, SiO4 a 2MgSsiO3 


olivine orthopyroxene 


+ 2Fe + O2 


2 
alloy (2) 


The Fe content of the Ir-Fe fo,-sensor shifts to equilibrate with fo,, 
which can be measured by chemical analysis of the silicate and alloy 
phases. The ferric Fe contents of the garnet layers in the run products 
were determined by Mossbauer spectroscopy. Experiments were con- 
ducted at pressures of 3, 6 and 7GPa (corresponding to depths 
of 90-250km) and temperatures of 1,300-1,600°C for periods of 
12-24h. 

The resulting garnet Fe** contents (~5-18% of total Fe) are similar 
to values found in garnets from mantle xenoliths’. As can be seen in 
Fig. 1, oxygen fugacities calculated using equilibrium (1) are in good 
agreement with those determined from the Ir-Fe sensor equilibrium (2) 
at 3 GPa but deviate strongly by 1-2 log units at 6 and 7 GPa. The origin 
of this discrepancy at higher pressure is unclear, although we note that 
constraints on thermodynamic data for the Fe3Fe3  Si;01 garnet com- 
ponent skiagite are sparse”'* (Methods). A further oxythermobarom- 
eter equilibrium 

2Ca3Fe2Si30}2 ale 2Mg,AlSi302 + 4FeSiO; 
garnet 


= 2Ca3AlSi3012 + 4Fe,SiO4 + 


garnet orthopyroxene 


3 
6MgsiO 3 ( ) 


+ Oz 


garnet olivine orthopyroxene 


was proposed in a previous study’ and has been successfully tested at 
3 GPa and 1,300°C (ref. 7). The equilibrium is based on the ferric- 
Fe-bearing garnet component andradite (Ca3Fe,Si,O0,.), for which 
thermodynamic data are well constrained’*, and although thermo- 
dynamic data for the Fe;Fe3” Si;01> garnet component are still required 
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for treatment of the reciprocal reactions, these are likely to be less 
sensitive to errors in pressure and temperature dependence 
(Methods). As shown in Fig. 1, oxygen fugacities calculated from the 
experimental data using equilibrium (3) are in much better agreement 
with those determined from the Ir-Fe sensor over the entire pressure 
range when compared with the results for equilibrium (1). 

In Fig. 2, oxygen fugacities calculated using equilibrium (3) for 
xenoliths from the Kaapvaal, Slave and Siberian cratons are shown 
as functions of equilibration depth**'*’. The deepest determined 
peridotite oxygen fugacities are at least 1 log unit more oxidized than 
previous estimates and extend to only slightly more-reducing con- 
ditions than do abyssal peridotites that are considered to be MORB 
melting residues. The most reduced samples (<AFMQ ~—3) correlate 
well with indicators for extreme melt depletion such as high Cr contents 
and high olivine Mg/(Fe+ Mg) ratio, and more oxidized samples pro- 
bably result from subsequent metasomatism by melt or fluid phases. In 
the diamond stability field, the range of xenolith oxygen fugacities is 
quite narrow and lies between AFMQ —2 and AFMQ —3.5. This range 
is below that defined by the reaction 


enstatite 


+ Oz (4) 


magnesite olivine diamond 


which marks the upper fo, stability limit of diamond with respect to the 
solid carbonate mineral magnesite’ and is denoted EMOD. Because 
temperatures in the lithosphere can be above the carbonated peridotite 
solidus*', an equilibrium equivalent to equilibrium (4) determines the 
stability of diamond with respect to pure carbonate melt” (Fig. 2, 
dashed curve; MgCO; melt). Further curves show the stability of diluted 
carbonate melts with respect to diamond in terms of the molar per- 
centage of the carbonate (CO}~ ) component. The majority of xenoliths 
from the diamond stability field have oxygen fugacities that would not 
be in equilibrium with pure carbonate melts but only with melt phases 
in which the carbonate component (or CO, component in a fluid) is 
relatively dilute, that is, 1-10%. For example, the diluting component 
could be H,O or, if temperatures were sufficiently high, silicate melt. 
The latter would be more consistent with low mineral H,O contents 
recently measured for such samples”, implying metasomatism at 
temperatures substantially higher than the present average mantle tem- 
perature. However, the range of fo, is above that where fluid or melt 
phases would contain a considerable CH, component. The results, 
therefore, rule out CH4- and MgCO3-rich fluids or melts being major 
pervasive metasomatic agents in the Archaean lithosphere. 
Equilibrium (3) will also exert some control over the oxygen fugacity 
of adiabatically upwelling mantle, which can influence the inception 
of decompression melting beneath mid-ocean ridges. In Fig. 3, the 
oxygen fugacity of a garnet peridotite with a bulk silicate Earth com- 
position‘ is calculated as a function of the bulk rock Fe’*/~Fe ratio 
using equilibrium (3) (Methods Summary). Experimental partitioning 
data, and data on the Fe** distribution in xenoliths, are used for this 
calculation (Methods). As pressure decreases, the oxygen fugacity for a 
given bulk rock Fe*"/*Fe ratio will increase, causing a packet of 
mantle to auto-oxidize at a rate of ~1/60log units per kilometre’. 
Figure 3 also shows a curve indicating the stability limit of diamond 
and graphite with respect to carbonate melt, determined along an adia- 
bat. At oxygen fugacities below this curve diamond and graphite are 
stable, whereas above it carbonate melts can form. This curve dips at low 
pressure owing to the onset of the main phase of silicate melting”. 
Estimates for average mantle Fe’ /~Fe content are of the order of 
2%. However, this may be a minimum estimate because it is based on 
analyses of lithospheric mantle samples'*"*; Fig. 3 indicates that Fe’* 
contents of 3-4% of total Fe yield oxygen fugacities closer to those 
determined for the MORB source from basalt glass analyses”. At 
depths of 200km, carbon in mantle containing less than 4% Fe?" 
would be in the diamond stability field and only when decompressed 
to 150 km would the oxygen fugacity be compatible with the oxidation 
of graphite to form carbonate melt through the reduction of Fe** in 


LETTER 


a 
-3.0 T T T T T 7 
© 3-3.5 GPa (ref. 7) ae 
r A3GPa aa 
6 GPa } this study 7 
-aglk o7GPa y Ps J 
/ 
oe 
= O # 
§ odo e | 
S 7 
= L 7 J 
2 ) ‘e) 
& 7 
~ -6.0F 3 FA 
~Y 
S 7 
8 | rs one) oO ° © | 
8 NW AO = 
s -7.0+ vA 4 
iS) / 
© @ 
[o) L x i P| m a 
to ff 
-8.0+ A BI 
v4 
ll Z @ ©33.5 GPa (ref. 7) | 
ra A 3 GPa (this study) } Fe**/ZFe, iia) > Fee*/DF eg na 
Fa @ 6 GPa (this study) 
-9.0 1 1 1 L 1 1 1 1 
-9.0 -8.0 -7.0 -6.0 5.0 -4.0 -3.0 
b 
-3.0 T T T T T P| 
| © 3-3.5 GPa (ref. 7) / | 
A3GPa / 
O 6 GPa this study v4 
-4.0- o7GPa > ft 7 
oo coy 
S Po OY 6 
§ -5.0+ on” 7 4 
2 Oney, © 
S t oO f os 4 
® / 
=. -6.0+ AG oO Zl 
S Ao 
D 7 
8 L A | 
= @ A 
2 Va 
& -7.0+ Ov 4 
5 Yo 
2 7 
(S) L to of 4 
7 
-8.0 ur a 
a @ 3-3.5 GPa (ref. 7) 
Lf A 3 GPa (this study) } Fe**/DF epi > Fee*/2Feina | 
OF @ 6 GPa (this study) 
-9.0 2 
-9.0 -8.0 -7.0 -6.0 5.0 -4.0 -3.0 


Measured log(fo,) (Ir-Fe or Pt-Fe sensor) 


Figure 1 | Comparison between oxythermobarometers. a, Oxygen 
fugacities calculated using the Ir-Fe redox sensor, equilibrium (2), for 
garnet—peridotite assemblages equilibrated in this study are compared with 
those calculated using equilibrium (1) and the model from ref. 7. b, The fo, 
estimates from the Ir-Fe redox sensor compared with those calculated using the 
model proposed in this study for equilibrium (3). For comparison we show data 
at 3 and 3.5 GPa from ref. 7, where fo, was calculated using a similar Pt-Fe redox 
sensor. Reversal experiments are shown using filled symbols and 

indicate experiments that used synthetic and natural garnets with initial 
Fe’'/S“Fe ratios that were greater than those measured in the recovered 
garnet samples, whereas open symbols indicate lower initial garnet 

Fe’ '/S “Fe ratios. The arrows indicate the directions from which the 
corresponding data approach equilibrium. The oxygen fugacities calculated 
using equilibrium (1) deviate considerably from the alloy redox sensor 

values at pressures of 6-7 GPa, whereas those calculated with equilibrium (3) 
are in good agreement with the alloy sensor over the entire pressure range. 
Uncertainties (1c) in the Ir-Fe and Pt-Fe log(fo,) measurements are 0.3 log 
units, and those estimated for equilibria (1) and (3) are 0.5 and 0.7 log units, 
respectively (Methods). 
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Figure 2 | Log(f, ,) (normalized to the FMQ buffer) calculated for xenoliths 
from the cratonic lithosphere using equilibrium (3). In the key, black 
symbols indicate mantle xenoliths from the Kaapvaal craton (up-triangles, ref. 
2; squares, ref. 3; circles, ref. 14; diamonds, ref. 16; down-triangles, ref. 17), open 
symbols represent the Slave craton (squares, ref. 15; diamond, ref. 18) and grey 
squares represent the Siberian craton'’. The oxygen fugacity of the Fe-Ni 
precipitation curve indicates conditions under which nickel-rich iron metal will 
start to precipitate from coexisting silicates*"°. The curve labelled EMOG/D is 
the oxygen fugacity calculated for equilibrium (4) along a cratonic geotherm. It 
defines the stability field between diamond (or graphite) and magnesite 
(MgCOs) in the mantle. At depths >150 km, this curve dips owing to carbonate 
melting. The curves plotted below EMOG/D indicate oxygen fugacities for 
melts in equilibrium with diamond or graphite, for different carbonate (CO? ) 
contents” (molar percentage of MCOs; is indicated for each curve, where M is a 
divalent cation). The majority of cratonic xenoliths would coexist with melts 
characterized by dilute carbonate contents (1-10%). The light-grey region 
indicates the diamond stability field. The dark-grey rectangle indicates the 
range of oxygen fugacities exhibited by abyssal peridotites”. 


silicates®. The uncertainty in the depth of initial carbon oxidation is 
~50km once the errors in the determinations of fo, are taken into 
account. The partitioning of water from mineral phases into the melt 
may increase this depth by a similar magnitude. The reduction of Fe’ 
would force the oxygen fugacity to remain buffered”® along the equi- 
librium curve between graphite and carbonate melt until all graphite in 
the rock had been oxidized to carbonate. The oxidation of 30-p.p.m. 
graphite, which is a typical carbon content estimate of the MORB 
source”’”’, would occur over a depth interval of approximately 
30 km and only then would auto-oxidation continue to increase the 
oxygen fugacity. 

Typical estimated MORB source carbon contents are sufficient to 
reduce the bulk Fe** content of the mantle by 1% of total Fe during 
decompression, implying that the mantle at depth is richer in Fe*~ by 
an amount proportional to the ultimate CO, content of the MORB 
source. This has several consequences. First, geophysical studies have 
been used to argue for the presence of incipient carbonate melting at 
depths >200 km beneath mid-ocean ridges**”’. If this were the case 
then oxidation of 30-p.p.m. carbon to carbonate at this depth would 
require the bulk mantle Fe*~ content to be >7 + 2% of total Fe, which 
is higher than previous estimates'®’*. Second, although estimates for 
MORB mantle C contents are 30 p.p.m., estimates for some enriched 
sources—based, for example, on CO, contents of ocean island 
basalts—have C contents in the range 300-1,300p.p.m. (ref. 21). 
Several studies have indicated that the formation of majoritic garnet 
additionally contributes to a decrease in mantle fo, such that the mantle 
probably contains Fe-Ni-rich metallic alloy at depths >300 km (refs 9, 
10, 30). In the transition zone, and the lower mantle in particular, 
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Figure 3 | Speciation of carbon in adiabatically upwelling mantle. 

a, Estimates for the oxygen fugacity of a bulk silicate Earth mantle”, determined 
for different bulk rock Fe? */) “Fe ratios calculated along a 1,320 °C adiabat are 
indicated by the parallel grey curves. These curves are determined from 
equation (6) using experimental and natural mineral partitioning data****”. 
The bold black curve indicates the oxygen fugacity ofan assemblage buffered by 
graphite (or diamond) coexisting with carbonate melt” and dips at low 
pressure owing to the onset of silicate melting. The green rectangle indicates the 
range of fo, estimated for the MORB source’. Mantle containing 4% Fe? */S~Fe 
would be in the diamond stability field at depths >170 km. At shallower depths, 
transformation to graphite would occur at point [1]. At 150 + 50 km, the 
oxygen fugacity of upwelling mantle would intersect the equilibrium between 
graphite and carbonate melt (point [2]), which would initiate carbonate 
melting through the reduction of Fe*” in silicate phases. This ‘redox melting’ 
process would occur over a depth interval of ~30 km, between points [2] and 
[3], over which 30 p.p.m. of carbon in the mantle source would be oxidized by 
the reduction of the bulk rock Fe*'/S~Fe ratio by 1%. For carbon contents 
>30 p.p.m. in the mantle source, a greater initial Fe’ */S~Fe ratio is required for 
the oxygen fugacity to remain in the range estimated for the MORB source, 
which is remarkably uniform”. At point [4], the carbonate melt will evolve 
towards a silicate melt composition. b, The path followed by the blue arrows in 
a is demonstrated with the same points indicated. Carbonate melts formed at 
depth as a result of redox melting are potentially focused into the source region 
of basalts*' (dashed diagonal lines). Redox melting to produce carbonate melt 
occurs at a depth ~150km shallower than the carbonated peridotite solidus. 


the stability of minerals with substantial Fe?” contents at low oxygen 
fugacity would ensure that even large mantle CO, contents would 
reduce to form diamond, CHy, Fe-Ni carbide or C-bearing metallic 
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alloy*"°. To convert even this lowest estimate of 300-p.p.m. C into CO, 
on decompression would require the ocean island basalt source at 
depth to have >10% of total Fe as Fe*™. 

This large Fe** content might imply the existence of either an Fe°* - 
rich primordial mantle source® or the involvement of recycled oceanic 
lithosphere richer in both C and Fe*", possibly as a result of the 
pressure-induced reduction of carbonates*’. However, C contents of 
ocean island basalt sources might be overestimated as a result of car- 
bonate-melt focusing. The inception of deep carbonate melting during 
upwelling, due to depression of the carbonated-peridotite solidus, can 
potentially focus carbonate-rich melts extracted from much larger 
volumes of the mantle into the smaller source volume from which 
the bulk of basaltic magma is extracted*’’’. Deep carbonate-rich 
melting, however, requires higher initial mantle Fe** contents, which 
couples the extraction of carbon from the mantle to the oxygen fuga- 
city. High CO, contents of basaltic magmas, therefore, do not neces- 
sarily imply high carbon source concentrations but instead plausibly 
imply more-oxidized sources. A more-reduced mantle in Earth’s early 
history would therefore have aided the preservation of the mantle’s 
primordial C content. In addition, periods of upwelling of more- 
oxidized mantle, due to recycling or upwelling from a mainly isolated 
but more-oxidized lower mantle®, may have extracted large amounts of 
CO, from the mantle in discrete episodes, such as those producing 
large igneous provinces”. 


METHODS SUMMARY 


Experiments were performed using several starting materials to cover suitable 
ranges of fo, and to demonstrate the approach to equilibrium of garnet com- 
positions from both initially reduced and oxidized (a reversal in terms of 
garnet oxidation state) garnet starting materials. Multi-anvil experimental cap- 
sules contained layers of natural garnet or synthetic glass representative of 
fertile garnet compositions (Supplementary Table 1) sandwiched between 
layers of a mineral mixture comprising orthopyroxene, garnet and olivine. In 
almost all the experiments, starting materials were placed inside graphite 
sleeves within a rhenium outer capsule. A flux of either carbonate or HO was 
added to all experiments to improve equilibration (Supplementary Table 2). The 
coexistence of graphite and carbonate melt in many experiments also buffered 
the oxygen fugacity (Supplementary Fig. 1). One carbon-free experiment was 
performed in a Re metal capsule. Iridium metal powder (3 wt%) was added to all 
experiments, and alloyed with Fe to act as a redox sensor. Chemical composi- 
tions of the mineral phases were obtained using an electron microprobe. The 
equilibrium fo, in each experiment was calculated from the redox sensor using 
the expression 


; AG?, 
08 fo.) = — FaoyRr 


—2log (rsa, )—2 log (an”) 


+ log(ar’, sio,) + log(a¥ie, sio, ) 


(5) 


where T is the temperature, R is the gas constant, AG, is the standard-state Gibbs 


free-energy change of equilibrium (2), a2” 


is the activity of Fe in the Ir-Fe alloy 
and ape, SiO, is the activity of the Fe,SiO, component in olivine (and the other 
activities are defined similarly). The resulting Fe**/S~Fe ratio in garnets from 
high-pressure experiments was measured using Mossbauer spectroscopy. Samples 
were double-polished to a thickness of 200-300 jum. Spectra were collected at 
298 K using a nominal 370-MBq °’Co point source (Supplementary Table 3 and 
Supplementary Fig. 2). 
The oxygen fugacity for equilibrium (3) was calculated from 


‘0 
Gs) 


log ( fo,) = ~ In(10)RT 


G 
+ 2log(aes,re,si,0,3) 


+ 2log(ayg, Al,Si,012 )+4 log teria, (6) 


G ol 
—2log (ac, Ak Sis0,.) — 4108 (4p, 550, ) 


—6log ne, ) 


where AG); is the standard-state Gibbs free-energy change of equilibrium (3). 
Activity-composition relations, thermodynamic data for equilibria (1)-(3) and 
calculated oxygen fugacities are reported in Supplementary Tables 4 and 5. 
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Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Starting materials and capsules. Starting materials for multi-anvil experiments 
consisted of layers of either natural garnet or garnet formed from synthetic glass 
(Supplementary Table 1) sandwiched between layers of natural olivine and 
orthopyroxene from San Carlos mixed with the same garnet composition. 
Compositions of starting garnets are given in Supplementary Table 1. Glasses were 
made from oxides melted and quenched at 1,600°C. Two glasses (a and b in 
Supplementary Table 1) were then reduced at 800 °C in a H2-CO, furnace at an 
oxygen fugacity equivalent to 2 log units less than the FMQ oxygen buffer. One of 
these glasses was then crystallized into two garnet samples in successive experi- 
ments performed at 3 GPa and 1,200 °C ina piston cylinder press using Pt capsules 
with added H,O. Two resulting garnet compositions were more oxidized than the 
starting glass and had Fe**/S~Fe ratios of 0.05 and 0.1, respectively (e and f in 
Supplementary Table 1). A further glass composition (d in Supplementary Table 
1) was not reduced and therefore preserved a high Fe.O3 content. In some experi- 
ments, 5% of a sub-equal mixture of orthopyroxene, olivine and carbonate was 
mixed within the garnet layer. In subsequent experiments using more-oxidized 
garnet glass compositions, only graphite was added to the garnet layer (these 
differences are indicated in Supplementary Table 2). All experiments except one 
(V721) were performed inside graphite sleeves and had 10% graphite powder 
mixed within the sandwiching olivine, orthopyroxene and garnet layers. Iridium 
metal (3 wt%) was added to all starting compositions to act as a redox sensor. In 
addition, a flux was added to all experiments. This was MgCO3, (Mg,Ca)CO3 
(formed by adding a 1:1 mixture of MgCO; and CaSiOs, with the latter also 
resulting in the formation of diopside) or H,O added as brucite. The graphite 
sleeves were enclosed in Re-foil outer capsules”. Experiment V721 was graphite 
free and was encapsulated in a Re sleeve inside a Pt capsule. 

Multi-anvil experiments. Multi-anvil experiments were performed using tung- 
sten carbide cubes with a corner-truncation edge length of 11 mm, in Cr,03-doped 
MgO octahedral pressure media with an edge length of 18 mm. LaCrO; or graphite 
heaters were used, placed inside an outer ZrO; sleeve. Experimental capsules were 
2mm in diameter and 3.5mm in length, and were placed at the centre of the 
cylindrical heater inside MgO spacers. The temperature was monitored with a 
Wo7Re3-W7sRez5 (D-type) thermocouple inserted within an alumina sleeve, with 
the junction in contact with the top of the capsule. Further details can be found 
in ref. 22. 

Analytical techniques. All the recovered run products were mounted in epoxy 
resin and then ground and polished using ethanol to preserve the carbonate phases 
(solid and melt) for analytical investigations. The chemical compositions of liquid 
and mineral phases were obtained using a Jeol JXA-8200 electron microprobe 
equipped with five wavelength-dispersive spectrometers. The Fe’ */)~Fe ratio in 
garnets recovered from high-pressure experiments was measured using 
Mossbauer spectroscopy. Samples were double-polished to a thickness of approxi- 
mately 200-300 jum to give an absorber thickness of about 5mgFecm * and to 
avoid saturation effects. Spectra were collected at 298 K using a nominal 370-MBq 
°7Co point source*®. A velocity range of —5 to +5mms_ | was always used. The 
velocity scale was calibrated relative to a 25-tm-thick «-Fe foil. Finally, spectra 
were folded and fitted to Lorentzian line shapes using the fitting program 
MOSSA”’. Spectra were fitted using one doublet for ferrous iron in the dodecahe- 
dral site, one doublet for ferric iron in the octahedral site*** and one doublet 
(where needed) for Fe** in olivine and/or orthopyroxene, using conventional 
constraints. The ferric iron content in garnets was calculated from the relative 
areas corrected for differences in recoil-free fraction** and, together with hyperfine 
parameters, are reported in Supplementary Tables 2 and 3. 

Thermodynamic data and activity models. The oxygen fugacity in each experi- 
ment was determined from equilibrium (2) using the Ir-Fe redox sensor''”??° 
and equation (5). Thermodynamic data to calculate AG? are reported in ref. 22. 
The oxygen fugacity determined for the garnet peridotite oxythermobarometer 
equilibrium (3) is calculated using equation (6). We determined AG? from the 
data of ref. 13, and fitted it to 


AGS) 21,655 (13.6149—0.01833T)P 
= —8.6623+ 


In(10)RT T 7 T 


with temperature in kelvin and pressure in kilobars. 

Activity-composition relations for components in both equations are described 
below, with all interaction terms reported in Supplementary Table 4. 

The activities of Fe,SiO, and Mg,SiO, components in olivine are described by 


2 
ol _ 
4g, SiO, aise) 


Ae SiO, = (Xzc) 
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where Xpe = Fe/(Fe+Mg) and the activity coefficients y,, (m= Fe, Mg) are 
described using 


RT In(y,,)= Wel 


2 
Feme(l —Xm) 


where WreMg is a symmetric Margules interaction parameter’. 


The activities of FeSiO; and MgSiO; in orthopyroxene are assumed to be ideal, 
for example 


AreS0, = (Xre) 
and the activity of Fe in Ir-Fe alloy is described by the expression 
ate” = (Xrelre) 
where 
RT In (ype) 
alloy 
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For garnets, the activity of each component is described by 
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The mole fraction of Mg on the dodecahedral site of garnet is, for example Mg/ 
(Mg+Ca+Fe**), and the mole fraction of Al on the octahedral site of garnet is Al/ 
(Al+Fe** +Cr). Activity coefficients are calculated using ternary Margules activ- 
ity-composition expressions”, with symmetric’ and asymmetric” terms used for 
octahedral and dodecahedral sites, respectively. Non-ideality due to multisite 
mixing in garnet, that is, (Yg.aisiso.2) is treated with respect to the 
following reciprocal reactions”: 


reciprocal? 


Mg3ALSi;0;2 + Ca3FeSi3012 = Mg3Fe2Si3;012 + CazALSi;0;> (7) 


Fe3A1,$i30,2 + Ca3Fe,Si30)2 = Fe3Fe,Si30,2 + Ca3ALSi3012 (8) 
Mg3Al,Si;0;2 + Ca3Cr2Si30}. = Mg3CrSiz0;. + Ca3Al$i;012 (9) 


Fe3Al,Si30;2 + Ca3Cr2Si3012 = Fe3Cr2Si30 12 + Ca3Al,Si3012 (10) 


The standard-state Gibbs free-energy changes of these reactions (as indicated by 
subscripts) are determined to be AG (7) = AG*‘g) = 33,000 — 70PJ (where P is 
expressed in kilobars; ref. 12) and AG*o) = AG a0) = 50,000 J (ref. 7). The corres- 
ponding activity coefficients arising from these reciprocal expressions are’ 


RTln (Yeas Ab Sis O12 ) reciprocal 
= —XygXper+ AG) — XpeXpes+ AG?s) —XugXcr AG) 
—XpeXcrAGfq) 
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Uncertainties for the alloy sensor are ~0.3 log units, taking into account uncer- 
tainties in pressure and temperature from experimental runs (0.5 GPa and 50 °C), 
from the chemical composition of the run products and from thermodynamic data”. 

The contribution of the reciprocal reactions to the activity coefficient terms is 
the main source of uncertainty in the use of equilibrium (3) to determine fo,. In 
particular, there are large uncertainties in the magnitudes AG*;7) and AG*¢s), 
which here are assumed to be equal and constrained by a previous study’’. 
Taking into account the errors in the previous determinations of these values, 
we arrive at an uncertainty of 10 kJ mol! for AG°,7, and AG°g), which propagates 
to an error of approximately 0.5 log units in fo,. Combining this error with that 
typical for garnet ferric Fe determinations results in an uncertainty of 0.7 log units. 
Origin of the discrepancy between f), determinations using equilibria (1) and 
(3). As shown in Fig. 1, oxygen fugacities determined using equilibrium (1) are 
lower than those calculated with equilibrium (3) and the Ir-Fe redox sensor at 
pressures of 6-7GPa. As stated, we note that thermodynamic data for the 
Fe3Fe3 * Si30; 2 garnet component skiagite are sparse and are determined from a 
previous study that examined the equilibrium 


Fe;Fe3* Siz;0,> am FeAl O4 = Fe3 Al, Si3012 + Fe3;O4 


garnet spinel garnet spinel 

and determined AG® for this exchange equilibrium at 1,300 K (ref. 12). Using 
ancillary thermodynamic data, it was then possible to estimate A,G®, the Gibbs 
free energy of formation, of the Fe3Fe}"Si;0,. garnet component at 1,373 K. 
However, no data exist with which to determine the entropy for this component, 
which was therefore estimated in ref. 7. As a result, large uncertainties must be 
associated with the Gibbs free-energy change for equilibrium (1) at temperatures 
other than 1,373 K. Uncertainties in this estimate might at least partly explain 
the difference in calculated oxygen fugacities for equilibria (1) and (3) at high 
pressures. Although the reciprocal reactions for equilibrium (3) also require ther- 
modynamic data for Fe3Fe}*Si;0 and Mg3Fe3*Si;O,2, which are assumed to 


have identical values, here uncertainties in the entropy are less important as they 


probably cancel out on either side of the equilibrium, enabling the estimated A;G° 
for Fe;Fe3" Si;O,, to provide a sufficient approximation for the Gibbs free-energy 
changes of the reciprocal reactions. 

Calculation of the oxygen fugacity along a mantle adiabat. The calculation of 
mantle fo, for a bulk silicate Earth composition as a function of bulk rock Fe**/ 
>2Fe ratio (Fig. 3) is made using equation (6). Mineral compositions for a four- 
phase assemblage are determined by performing a mass balance with Al contents 
of orthopyroxene and clinopyroxene determined from experimental and natural 
data*"**° and using experimental Fe-Mg partitioning data**”*. The distribution 
of Fe** between garnet, clinopyroxene and orthopyroxene is determined from 
models derived from analyses of natural xenoliths'*. No consideration of the 
majorite component in garnet was made because the proportion should be small 
at the depths considered in the calculation. Similarly the orthopyroxene to clin- 
oenstatite (the high-pressure monoclinic polymorph with space group C2/c) 
transition was ignored. 
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Ediacaran life on land 


Gregory J. Retallack! 


Ediacaran (635-542 million years ago) fossils have been regarded as 
early animal ancestors of the Cambrian evolutionary explosion of 
marine invertebrate phyla’, as giant marine protists” and as lichen- 
ized fungi’. Recent documentation of palaeosols in the Ediacara 
Member of the Rawnsley Quartzite of South Australia* confirms 
past interpretations of lagoonal-aeolian deposition based on synse- 
dimentary ferruginization and loessic texture*®. Further evid- 
ence for palaeosols comes from non-marine facies, dilation cracks, 
soil nodules, sand crystals, stable isotopic data and mass balance 
geochemistry*. Here I show that the uppermost surfaces of the 
palaeosols have a variety of fossils in growth position, including 
Charniodiscus, Dickinsonia, Hallidaya, Parvancorina, Phyllozoon, 
Praecambridium, Rugoconites, Tribrachidium and ‘old-elephant 
skin’ (ichnogenus Rivularites’). These fossils were preserved as fer- 
ruginous impressions, like plant fossils*, and biological soil crusts””® 
of Phanerozoic eon sandy palaeosols. Sand crystals after gypsum” 
and nodules of carbonate” are shallow within the palaeosols*, even 
after correcting for burial compaction”’. Periglacial involutions and 
modest geochemical differentiation of the palaeosols are evidence of 
a dry, cold temperate Ediacaran palaeoclimate in South Australia‘. 
This new interpretation of some Ediacaran fossils as large sessile 
organisms of cool, dry soils, is compatible with observations that 
Ediacaran fossils were similar in appearance and preservation to 
lichens and other microbial colonies of biological soil crusts’, rather 
than marine animals’, or protists”. 

Newly documented palaeosols in the Ediacara Member of the 
Rawnsley Quartzite in South Australia* now call for a re-evaluation 
of its famous fossils, widely considered evolutionary predecessors of 
the Cambrian explosion of marine animal phyla’. Ediacaran red beds 
of South Australia (Figs 1 and 2) were initially considered non-marine 
by Douglas Mawson and Ralph Segnit®. When Mawson’s student 
Reginald Sprigg discovered and interpreted South Australian Edia- 
caran fossils as marine jellyfish’, this palaeoenvironmental contradic- 
tion was resolved by a compromise interpretation of jellyfish thrown 
up onto tidal flats by storms’’. Ediacaran fossils are known worldwide 
in a variety of sedimentary facies'®, generally interpreted as shallow to 
deep marine, following Sprigg’s'* comparison with marine animals’, 
although such comparisons now seem increasingly doubtful. Most 
Ediacaran fossils have no clear relationship with modern animals**"*"”. 
Putative Neoproterozoic ‘embryos’ were more likely to have been 
protists’*, Putative permineralized metazoans may instead have been 
crystal-lined vughs”’, and other permineralized Ediacaran fossils were 
red algae or glomeromycotan lichens”. Precambrian shallow trails may 
have been made by slime moulds in their slug aggregation phase rather 
than worms”’. There have also been suggestions that Ediacaran fossils 
were giant protists, such as xenophyopores’, or fungi, such as lichens’. 

Palaeosols in the Ediacara Member have been overlooked until now, 
because they are less strongly developed than palaeosols at Precambrian 
unconformities or formed under forests of the Devonian period and 
later*. In addition to obvious soil structures (platy peds) and horizons 
(A-Bk and A-By), bedding disruption on mainly microscopic scales 
contributes to the massive appearance of palaeosols compared with 
sedimentary rocks in the field (Fig. 2b-e). One bed (Warrutu palaeosol 
of Fig. 2b) has four distinct episodes of soft sediment deformation 


followed by successive weathering and bedding disruption of previous 
episodes, comparable with successive periglacial soil involutions, rather 
than seismic or load deformation*. Further evidence for Ediacaran 
palaeosols, detailed elsewhere* includes (1) geochemical mass-balance 
negative strain and cation mass transfer; (2) loessic grain-size distri- 
bution and texture; (3) unusually light carbon and oxygen isotopic 
compositions that show linear covariance; and (4) sand crystals of 
gypsum and micritic replacive nodules with a consistent depth from 
the tops of beds. The 47 different stratigraphic levels showing pedogenic 
features in Brachina Gorge (Fig. 1) are repetitions of five distinct types 
of palaeosol named as pedotypes from the Adnamatna indigenous 
language (see Supplementary Information, Supplementary Fig. 2 and 
Supplementary Tables 1 and 2). 

The red colour and weathering of rocks in the Flinders Ranges have 
been regarded as products of deep weathering from the Cretaceous 
period or later”’, but this view is falsified by several observations. Red 
beds of the Ediacara Member have been found beneath grey shales and 
limestones in drill cores in the Ediacara hills, where the cores have the 
same unusual and distinctive carbonate carbon isotopic composi- 
tion and major element composition as the outcrops in Brachina 
Gorge*. Grey sandstone palaeochannels of the Ediacara Member 
include red clasts redeposited from the Ediacara Member and Bonney 
Sandstone, as well as grey calcareous clasts from the Wonoka Forma- 
tion: all should be red if they were weathered downwards from the 
current land surface. A variety of clay crystallinity indices, X-ray dif- 
fraction data and microprobe analyses demonstrate that the Ediacara 
Member in outcrop and core is illite-chlorite that has been heated to 
low within the greenschist metamorphic facies* (see Supplementary 
Information, Supplementary Figs 3-6 and Supplementary Table 4). 
Furthermore, Ediacara Member sandstones contain abundant feldspar, 
and red siltstones include carbonate nodules. By contrast, deep weather- 
ing profiles contain no feldspar nor carbonate, and have kaolinite clays 
and a chemical composition strongly depleted in alkali and alkaline 
earth elements*. The red colour and degree of weathering do not dis- 
tinguish palaeosols from sediments, because both can be acquired 
from soils in source terrains, but the relative timing of reddening and 
weathering is crucial to a terrestrial-aeolian interpretation’. 

Palaeosols and fossils of the Ediacara Member are distinguished by 
a surface texture called old-elephant skin, which is best preserved 
under covering sandstone beds’. Rivularites repertus is a validly named 
ichnospecies for comparable cracked and pustulose surfaces’. What 
makes it look old is healed cracks, irregular fine ridges (cracks in cover 
sandstone), and pustulose relief of intergrown radial growth centres 
(Fig. 2a). These distinctive cracked and pustulose surfaces have a 
variety of features that are more like the biological soil crusts of desert 
and tundra”’® than the parallel-wrinkled, and undulose hydrated 
microbial mats of intertidal flats and shallow seas’. Biological soil 
crusts and their soils have vertically oriented organisms intimately 
admixed with minerals of the soil, whereas aquatic microbial mats 
are laminated, and detachable from their mineral substrate as flakes, 
skeins and rollups, not seen in the Ediacara Member. Soil crusts have 
irregular relief, healed desiccation cracks and pressure ridges even in 
clay-poor sandstones, whereas microbial mats have flexuous, striated 
domes and tufts, again not seen in the Ediacara Member. Soil crusts are 


1Department of Geological Sciences, University of Oregon, Eugene, Oregon 97403-1272, USA. 
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Figure 1 | Geological section of upper Ediacara Member in Brachina Gorge, 
South Australia. The interpreted palaeosol position and development (height 
and width of black boxes, respectively*) are shown. Calcareousness assessed in 
field by degree of reaction with 10% stock HCl. Hues such as 10YR and 10R are 
from a Munsell chart. This was the entire Ediacara Member as originally 
defined® in the Brachina Gorge (31.34422° S, 138.55763° E). 


the upper part of deeper soil profiles with downward variation in 
oxidation, clay abundance and replacive nodular subsurface horizons 
like palaeosols of the Ediacara Member, whereas microbial mats form 
caps to unweathered, chemically reduced sedimentary layers. Soil 
crusts develop increasingly differentiated soil profiles through time, 
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whereas microbial mats build upwards in laminar to domed (stroma- 
tolitic) increments. Sandy palaeosols with impressions of lichen-like 
fossils are also known from Ordovician’ and Cretaceous palaeosols’”. 
Comparable preservation of vascular land plants is well known in red 
sandy palaeosols of Cretaceous age’. 

Ediacaran fossils were preserved as impressions in old-elephant 
skin sandstones overlying four of the five different kinds of palaeosol 
at ten different horizons in four classic localities (see Supplementary 
Information and Supplementary Table 3). The best place to see 
Ediacaran fossils in place above a palaeosol is the overhang at the 
39.7 m level in Brachina Gorge (Fig. 2b), where there are still good 
specimens of Dickinsonia costata and Pseudorhizostomites howchini. 
The body fossils were firmly attached or embedded within these soil 
surfaces in life, as revealed by growth series, lack of overlapping speci- 
mens, and thickening of adjacent specimens comparable with compe- 
titive reaction*””. Some taxa such as Phyllozoon hanseni and Aulozoon 
sp. are embedded within the surface layer, like window lichens and 
rhizines in desert crusts*. In petrographic thin sections, branching tubu- 
lar structures extend deep into the palaeosols (Fig. 2c-g), like cyano- 
bacterial ropes, fungal hyphae and lichen rhizines of desert crust soils 
today’. Petrographic thin sections of the lower half of impressions of 
Dickinsonia fossils show comparable bedding disruption by irregular 
tubular features (Fig. 2c, d): the more common upper impression fossils 
show only bedded sandstone overlying the fossil (Fig. 2e). Comparable 
bifacial fossil features, with smooth and finished upper surfaces but 
ragged lower surfaces, were also found in surface horizons of the palaeo- 
sols (Fig. 2e-g). Which specific Ediacaran fossils are represented by 
these thin sections is uncertain, because Ediacaran taxa have been 
defined by shape, not appearance in thin section. These images of com- 
plex chambered structures with basal tubules (Fig. 2c-g) are preliminary 
indications of their appearance in thin section prepared for a detailed 
study in progress. Observed cross-sections of Ediacaran fossils in petro- 
graphic thin sections are comparable in preservational style with plants 
and lichens in Phanerozoic palaeosols*””. 

Body fossils so far documented on the palaeosols include ‘Aulozoon’ 
sp. indet., Charniodiscus arboreus, ‘Cyclomedusa davidi’,, Dickinsonia 
costata, D. elongata, D. rex., Hallidaya brueri, cf. ‘Kimberella sp. indet., 
“‘Medusinites asteroides’, Parvancorina minchami, Phyllozoon hanseni, 
Praecambridium sigillum, Rugoconites enigmaticus, Spriggina floun- 
dersi and Tribrachidium heraldicum. Trace fossils found in the palaeo- 
sols include Archaeonassa sp. indet., Pseudorhizostomites howchini, cf. 
‘Radulichnus’ sp. indet. and Rivularites repertus (see Supplementary 
Information and Supplementary Table 3). Some palaeosols (Muru and 
Wadni pedotypes) have a diverse fossil assemblage dominated by 
Dickinsonia, whereas others (Yaldati and Inga) have a low-diversity 
assemblage mainly of discoid fossils (variously attributed to microbial 
colonies, medusoids or holdfasts, and taxa such as Medusinites and 
Cyclomedusa’). One palaeosol with deformation interpreted as peri- 
glacial involutions* (Warrutu pedotype) yielded no fossil specimens in 
outcrops, but may have Ediacaran fossils in thin sections (Fig. 2f, g). 
These differences in diversity may be evidence that Ediacaran orga- 
nisms preferred unfrozen, low salinity soils, rich in nutrients, like most 
terrestrial organisms. 

Not only fossil diversity, but fossil size may have increased with 
palaeosol development. The relationship between gypsum abundance 
(G in area (%)) and geological age (A in kyr) in the Sinai and Negev 
Deserts of Israel’*** is given by the following equation: 


A=3.987G+5.774 (1) 


This relationship (R* = 0.95) has the standard error + 15 kyr. The 
largest Dickinsonia (32 cm long, D. rex'*) from the main collecting gully 
in the Ediacara hills came from a Muru palaeosol with 10% gypsum, 
and using equation (1), was 53.6 + 15 kyr in the making. Long-term 
growth of this D. rex (0.006 + 0.002 mm yr — ') would have been more 
like modern lichens (Rhizocarpon geographicum) in the Brooks Range 
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Figure 2 | Palaeosols of the Ediacara Member of the Rawnsley Quartzite, 
South Australia. a, Rivularites repertus (old-elephant skin) showing sutured 
radial growth, crack fills and ridge impressions, effaced discoid fossils (white 
circles) and fossil impressions (Hallidaya brueri in positive relief to left, and 
Rugoconites enigmaticus in negative relief to right), on sole of sandstone slab 
from Crisp Gorge (31.176572° S, 138.328533° E). b, Palaeosols and fossil levels 
in the Brachina Gorge (31.34422° S, 138.55763° E). c, d, Vertical petrographic 
thin sections of the lower part of D. elongata (c) and D. costata (d) showing 
bedding disruption and tubular features, from Muru palaeosol in the Ediacara 
hills (c) and unknown palaeosol at the Hookapunna well (d). e-g, Unidentified 


of northern Alaska (0.04 mm yr ') than lichens in southern Alaska?’ 
(0.1 mmyr ') or southern Norway”? (0.5-0.7 mmyr_'). Correlation 
between the largest Dickinsonia found within a collection and gypsum 
enrichment in the palaeosol of that collection is evidence for slow non- 
linear growth rates of Dickinsonia (Fig. 3, Supplementary Information 
and Supplementary Table 3). The Ediacara hills has yielded one of the 
most diverse assemblages known, and Ediacaran organisms may have 
diversified as well as grown with soil age, comparable with modern 
terrestrial communities. 

Indications of palaeoclimate come from comparison of palaeosols in 
the Ediacara Member with modern soils. The closest modern analogue 
to Ediacaran gypsic palaeosols seems to be soilscapes on the coastal 
plain of the Caspian Sea near Atyrau, Kazakhstan (mean annual tem- 
perature 8 °C, mean annual precipitation 160 mm: map unit Zo16-3a of 
Orthic Solonchak, with associated Takyric and Gleyic Solonchaks”’). 
Calcic palaeosols are found nearby in the Emba River floodplain (map 
units X1 16-lab and Jc 53-2c). Periglacial involutions in one palaeosol* 
support other evidence from dropstones for cold temperate palaeocli- 
mate in the coeval Billy Springs Formation of the far northern Flinders 
Ranges”. Other indications of cool and dry palaeoclimate from climo- 
functions for modern soils are outlined in the Supplementary Informa- 
tion, Supplementary Fig. 7 and Supplementary Tables 5 and 6. The 
zone of best preservation of large Ediacaran fossils in the Flinders 
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dorsoventral Ediacaran fossils in thin sections showing overlying cross-bedded 
sandstone (e) and basal irregular tubular structures 

(e-g). The fossiliferous surface in a is part of a large slab on display in the South 
Australian Museum, Adelaide. The hammer for scale in b has a length of 25 cm. 
Specimen numbers in the Condon Collection, Museum of Natural and Cultural 
History, University of Oregon are F112999 from Mutu palaeosol in the 
Ediacara hills (c), F115736 from unknown palaeosol near the Hookapunna well 
(d), R3218 from Muru palaeosol in the Brachina Gorge (e), and both R3223 
(f) and R3222 (g) are from the Warrutu palaeosol in the Brachina Gorge. All 
thin sections were cut vertical to regional bedding. 


Ranges is thus near the gypsic-calcic soil ecotone, an important biotic 
boundary in modern temperate deserts, such as the Atacama” and 
Negev” Deserts. 

Discovery of some Ediacaran fossils in the surface horizons of 
palaeosols does not mean that all Ediacaran fossils everywhere were 
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Figure 3 | Maximum length of Dickinsonia fossils related to area (%) of 
gypsum in same palaeosol, as a proxy for soil development. Measurements of 
the longest specimen from the largest collections are the most reliable (see 
Supplementary Information for data). 
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terrestrial. Intertidal to shallow marine facies recognized in the Ediacara 
Member’, have so far proven unfossiliferous, but Palaeopascichnus is 
known from both old-elephant skin surfaces of the Ediacara Member 
interpreted here as terrestrial, as well as in pyritic black shales of the 
Wonoka Formation interpreted as marine’. Multisegmented fossils 
from palaeosols of the Ediacara Member such as Dickinsonia, Charnia, 
Praecambridium and Spriggina are more likely to have been lichens or 
other microbial consortia* than marine invertebrates’ or giant protists’. 
Discoid Ediacaran fossils such as Cyclomedusa, Medusinites and Rugoso- 
conites would not be jellyfish in such dry soils, but could have been 
microbial colonies**. Small fossils such as Parvancorina or Tribrachi- 
dium could not have been pre-trilobites or proto-seat-stars’, respectively, 
if they lived on land, but may have been fungal-fruiting bodies*. Trace 
fossils such as Archaeonassa could have been created by metazoan slugs 
or worms after rainstorms on land, but terrestrial habitats also open the 
possibility that these trails were created by slug-aggregating phases of 
slime moulds”. ‘Radulichnus’ impressions from the Ediacara Member 
are too straight and sharp to be molluscan radular scratches’, and in cool 
temperate soils may instead have been casts of needle ice. Pseudorhizo- 
stomites has been considered a gas-escape structure in a marine setting”®, 
but as a soil-surface feature it is most like a flanged pedestal of a biological 
soil crust’. These surprising alternative terrestrial hypotheses for habitats 
and affinities of these enigmatic fossils arise largely from recognition of 
palaeosols, and their interpretation by comparison with modern soils and 
soil processes. These unconventional ideas and comparisons remain to be 
tested for different kinds of Ediacaran fossil, and in sequences and assem- 
blages of Ediacaran fossils beyond South Australia. 


METHODS SUMMARY 


The main contribution of this Letter is to document the geographic and stra- 
tigraphic occurrence of Ediacaran fossils in palaeosols described in detail else- 
where’, as well as new observations of Ediacaran fossils in petrographic thin 
sections. Further analytical data on these palaeosols are provided in the 
Supplementary Information and Supplementary Tables 4-6, and interpretation 
of their palaeoenvironmental setting is in Supplementary Tables 1, 2 and 7-9 
and Supplementary Figs 2 and 7. 
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Convergent acoustic field of view in echolocating bats 


Lasse Jakobsen', John M. Ratcliffe! & Annemarie Surlykke' 


Most echolocating bats exhibit a strong correlation between body 
size and the frequency of maximum energy in their echolocation 
calls (peak frequency), with smaller species using signals of higher 
frequency than larger ones’. Size-signal allometry or acoustic 
detection constraints imposed on wavelength by preferred prey 
size have been used to explain this relationship’’. Here we propose 
the hypothesis that smaller bats emit higher frequencies to achieve 
directional sonar beams, and that variable beam width is critical 
for bats. Shorter wavelengths relative to the size of the emitter 
translate into more directional sound beams‘. Therefore, bats that 
emit their calls through their mouths should show a relationship 
between mouth size and wavelength, driving smaller bats to signals 
of higher frequency. We found that in a flight room mimicking a 
closed habitat, six aerial hawking vespertilionid species (ranging in 
size from 4 to 21 g, ref. 5) produced sonar beams of extraordinarily 
similar shape and volume. Each species had a directivity index of 
11 + 1 dB (a half-amplitude angle of approximately 37°) and an 
on-axis sound level of 108 + 4dB sound pressure level referenced 
to 20 pPa root mean square at 10cm. Thus all bats adapted their 
calls to achieve similar acoustic fields of view. We propose that the 
necessity for high directionality has been a key constraint on the 
evolution of echolocation, which explains the relationship between 
bat size and echolocation call frequency. Our results suggest that 
echolocation is a dynamic system that allows different species, 
regardless of their body size, to converge on optimal fields of view 
in response to habitat and task. 

For echolocating bats, peak frequency in echolocation calls is nega- 
tively related to body size, a trend attributed to allometry'’. However, 
similarly sized birds, anurans and most mammals use much lower 
frequencies for communication®’, so allometry does not adequately 
explain why bats use such high frequencies for echolocation. Indeed, 
atmospheric attenuation® increases rapidly with frequency, and echo- 
location range is much shorter than it would be at lower frequencies’. 
Thus, there must be some functional explanation why bats, especially 
smaller species, use such high-frequency sonar signals. 

One hypothesis proposes that because small bats hunt small prey 
they use wavelengths short enough to be effectively reflected from their 
smallest quarry. However, this does not adequately explain bats’ high- 
frequency calls, as it presumes that strong echoes will only be reflected 
from objects with diameters equal to or greater than the wavelength, 
which is not true. To reflect sound efficiently, the radius, a, of an object 
relative to the wavelength, 7, of the impinging sound has to fulfil 27a/ 
A> 1. Thus the effective diameter of the object only has to be greater 
than approximately 1/3 (ref. 10). Consequently, a 6-mm-diameter insect 
will reflect echoes efficiently at 20kHz, which agrees well with data 
showing that even small insects (4-5 mm) reflect strong echoes down 
to 20 kHz"’. Diet analysis also reveals that many bats take prey with 
wing lengths much shorter than the wavelength of their sonar calls*. 

We propose a new hypothesis to explain bat size-signal allometry. 
Specifically, that smaller bats are constrained to higher frequencies to 
achieve a sufficiently directional beam, because sound beams broaden 
with decreasing emitter size (Fig. 1). A directional sonar beam is critical 
for echolocators, focusing energy in a forward-directed cone and thus 


minimizing off-axis echoes and increasing on-axis intensity and there- 
fore range**"”, 

For visual vertebrates, fields of view (FOV) depends on skull design 
and is relatively fixed'*. For echolocators, acoustic FOV is the volume 
ensonified sufficiently to generate detectable echoes’*'*. From FOV 
bats build and update their auditory world’®. Bats can adapt FOV 
(beam range and width) dynamically by changing: (1) call energy 
(range); (2) call frequency by laryngeal control (width); and (3) emitter 
size by gape control (width). Hence, the acoustic FOV of most bats 
should be more flexible than visual FOV and directionality an integral 
component of echolocation call design. In closed habitat, where range is 
not crucial, a broader beam provides a wider FOV, enhancing peri- 
pheral object detection. We predict that morphologically similar bats 
orienting in the same habitat will produce sonar signals of uniform 
beam shape, converging on an optimal FOV. 

Vespertilionidae comprises one-third of extant bat species and exhi- 
bits pronounced negative signal frequency to body size scaling’. We 
tested our predictions using six vespertilionid species of similar face and 
ear morphology, Pipistrellus pygmaeus, Myotis daubentonii, Vespertilio 
murinus, Myotis dasycneme, Eptesicus serotinus and Nyctalus noctula, 
that range in size from 4 to 21 g and produce calls over open field with 
peak frequencies of 20-55 kHz (Supplementary Table 1). We used a 


Figure 1 | Sonar beam width decreases as emitter (bat) size increases relative 
to wavelength. The beam width cartoons (three-dimensional figures) illustrate 
that for constant energy and emitter size, an increase in frequency (left to right), 
that is decrease in wavelength, focuses the energy in a sonar beam, to become 
narrower but longer, which at short distances counteracts the decrease in sonar 
range due to increased atmospheric attenuation at higher frequencies. The 
smaller the bat, the smaller is its emitter (mouth) size, and thus the broader its 
beam for constant frequency. Hence the smaller the bat, the higher the 
frequency required to maintain directionality of the biosonar beam. 


1Sound Communication Group, Institute of Biology, University of Southern Denmark, DK-5230 Odense M, Denmark. 
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Figure 2 | Vertical and horizontal directionality in six species of 
vespertilionid bats. Vertical (left) and horizontal (right) directionality for 

P. pygmaeus (half-amplitude angle (HA) = 36°), M. daubentonii (HA = 40°), 
V. murinus (HA = 37°), M. dasycneme (HA = 40°), E. serotinus (HA = 34°) 
and N. noctula (HA = 34°). Marker colour corresponds to font colour of each 
species’ name. Dashed black line indicates the much narrower directionality of 


multi-microphone array to determine sonar beam directionality as bats 
oriented in a large, screened flight room, and estimated their acoustic 
emitter size using the piston model!: 
2x i(k x ax sin(0)) 

kxax sin(0) 


R,(0)= 


where R,(0) is the ratio between the pressure on-axis and at a given 
angle 0, J; is a first-order Bessel function of the first kind, k = 27/2, / is 
wavelength, and a is piston radius. 

We measured directionality over a full octave band starting at 
—6dB down from peak (Fig. 2). All bats emitted calls with similar 
beam widths. Half-amplitude angles were 37 + 3°(Fig. 2). Call inten- 
sities also converged across species, 108 +4dB root mean square 
sound pressure level at 10cm (Supplementary Table 1). To quantify 
measured beam patterns we computed the directivity index of each call 
(Fig. 3). The directivity index (DI) compares on-axis sound pressure 
with the sound pressure of an omnidirectional emitter producing a 
signal of equal energy. Calculated DI values (10.7-12.1 dB) confirm 
directionality as nearly identical across species despite differences in 
emitter size and frequency. Maximum inter-specific DI differences of 
1.4 dB (Fig. 3) are negligible under ecologically relevant conditions. 

These similar DI values are striking because the range of DI values 
available to individual vespertilionid bats is greater than the inter- 
specific variation we observed in the flight room. M. daubentonii emits 
a narrower beam, with a DI of 16 dB, by opening its mouth wider over 
open field’. In the last phase of an aerial attack M. daubentonii and 
E. serotinus lower their call frequency an octave, emitting broader beams 
(DI of 6 dB)’ (Fig. 3). Our data indicate that vespertilionids actively 
control directionality, adjusting emitter size and frequency, to converge 
on the same beam width. In closed habitats like our flight room, sonar 
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M. daubentonii when flying in open field, whereas grey lines in both plots 
indicate the broad directionality M. daubentonii uses in the terminal phase of 
the prey pursuit. The bottom panel shows spectrograms of the calls emitted in 
the flight room by the six species in order of size from left to right. The inset in 
the top right corner shows a spectrum of a M. daubentonii call indicating the 
one octave band-pass filter starting at the —6 dB low-frequency cut-off. 
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range is not an issue and a broader beam provides peripheral infor- 
mation optimal to this habitat and task. Conversely, increasing range 
takes precedence while over open field, where, as demonstrated for M. 
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Figure 3 | Directivity index for six species of vespertilionid bats with 
forearm lengths ranging from 32 to 54mm. Directivity indices (DI) (+ s.d.) 
calculated from best fit of the piston model for total energy of the first harmonic 
(1 octave filter, see Fig. 2) of each species’ echolocation call (n = 1—3 per 
species). DI compares the on-axis sound pressure with that of an 
omnidirectional emitter producing a signal of the same total energy that is 
DI = 0dB. The larger the DI, the higher the directionality. M. daubentonii 
emits more directional search calls in the field with DI of 16 dB (‘field’). M. 
daubentonii and E. serotinus emit broader beams with directivity indices of 6 dB 
in the last phase of the pursuit, the buzz I] (‘buzz’). The dotted line represents 
the mean of the DI values. 
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daubentonii', vespertilionid bats should emit more directional beams. 
Indeed, larger bats emit lower peak frequencies in open field, in- 
creasing range; in closed habitat, bandwidth is increased to include 
higher frequencies, enhancing resolution and temporal accuracy”. 
The two smallest species, particularly P. pygmaeus, increased peak fre- 
quency only slightly in the flight room, suggesting that, when orienting, 
smaller species regulate directionality almost exclusively by adjusting 
emitter size (Fig. 4). 

To relate our acoustic models to actual species-specific mouth sizes 
(maximum gape) we measured upper and lower jaw length (cranio- 
mandibular joint to front teeth) and the ratio between distance from 
the craniomandibular joint to the origin (A) and insertion (B) of the 
superficial masseter (Fig. 4). The longer A is relative to B, the larger the 
maximal gape angle’*. Assuming that inter-specific differences in A/B 
relate directly to differences in gape angle, we estimated gape height 
using a reported gape angle of 90° for M. lucifugus'? and a measured 
A/B of 2.1. We used upper jaw width at the second incisors and third 
molars to estimate species-specific differences in maximum gape width 
(Supplementary Table 1). 

To verify emitter size estimates from acoustic data, we compared 
them to actual measured data on mouth opening. Flying M. daubentonii 
had distances between the upper and lower lip of 5-8 mm (Supplemen- 
tary Fig. 1), closely matching acoustic data (Fig. 4 and Supplementary 
Table 1). Subsequently, we compared emitter size estimates and mor- 
phometric measures and also compared these data to acoustic data at 
the species-level and using independent contrasts” (see Methods for 
further details). 

Maximum gape height and width from skull measurements were 
comparable, but always larger than vertical and horizontal emitter size 
based on the piston model, suggesting that bats did not open their 
mouths to the maximum (Fig. 4). Gape estimates based on skull mea- 
sures better predicted open space peak frequency than body mass 
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Figure 4 | Gape size estimated from skulls and from the piston model. From 
dry museum specimens we measured the distance from the craniomandibular 
joint to both the origin (A) and the insertion (B) of the superficial masseter 
muscle. The ratio between A and B relates directly to gape angle and was used 
together with upper and lower jaw length to estimate maximum vertical gape 
opening (black squares; see Methods). We also estimated gape diameter from 
the acoustic data using the piston model (blue squares). Gape size based on 
sonar sound beams were comparable, but always below maximum gape size 
based on skull metrics, indicating that the bats did not open their mouths to the 
maximum gape opening when flying in the flight room. Peak frequencies of 
echolocation calls in the flight room (red asterisks) were always higher than 
typical peak frequencies of the same species flying in the field (red diamonds). 
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(Supplementary Table 2). Similarly, emitter size estimates fitted well 
with second incisor distances (Supplementary Table 2). Forearm 
length correlated with open space peak frequency and maximum esti- 
mated gape height and width (Fig. 4 and Supplementary Table 2), as 
predicted by allometry and flight speed. Relationships between maxi- 
mum estimated gape size and differences between open space and 
flight room frequencies also suggest that larger bats were using less 
than maximum gape in the flight room (Fig. 4 and Supplementary 
Tables 1 and 2). 

We found that when orienting in the same context, six species of 
vespertilionid bats produce sonar beams with ~37° half-amplitude 
angles despite species-specific differences in frequency and emitter 
size. Over open field, larger, faster flying bats require longer detection 
ranges than smaller slower species to equally sample the distance they 
travel between calls (Supplementary Table 1). Greater range favours 
lower frequencies because of reduced transmission loss through atmos- 
pheric attenuation’. If larger bats lower peak frequency proportionally 
more than smaller bats, they must open their mouths wider to maintain 
beam width. 

We propose that the requirement for a directional beam has driven 
the high frequencies of bat echolocation calls. Narrowing the beam 
focuses its energy and partially compensates for increased attenuation 
at higher frequencies, such that short ranges are not decreased for a 
given energy output (Supplementary Fig. 2). In the field, flight speed 
and call duration are proportional to bat size’; in our flight room all 
species flew at roughly the same speed (relative to the range for open 
field) and used similar call intensities and durations, with no clear 
correlation to bat size (Supplementary Table 1). Taking into account 
features defining the spatial filter formed by the sonar beam (call 
duration, intensity, beam width and flight speed), the six species 
sampled almost identical volumes. For open space, flight speed was 
positively related to intensity and negatively related to frequency 
(Supplementary Table 2) suggesting FOV is adjusted such that larger, 
faster bats monitor greater distances than smaller, slower bats. 

Our results support our prediction that sonar beam width and 
acoustic FOV is dynamically controlled to best monitor a particular 
environment. We believe that the high frequencies emitted by bats are 
largely dictated by FOV, which acts as an evolutionary constraint on 
echolocation call design™. The receiving side, that is directional hearing, 
will also influence the bat’s auditory perception”®, but the fact that bats 
adjust the beam to be narrower in open field’* and broader in the last 
phase of pursuit'* demonstrates that directionality on the receiving side 
does not diminish the importance of the outgoing acoustic FOV. 
Smaller bats typically have shorter, more gracile jaws and skulls and 
therefore smaller maximum emitter sizes. This forces smaller bats to 
use high echolocation call frequencies not because of preferred prey 
size or body size per se, but to obtain a directional beam over open 
field. The lack of correlation between bat size and signal frequency in 
phyllostomid bats’ is consistent with such a view. Phyllostomids are 
nostril-emitting echolocators and sonar beam width depends on nose- 
leaf dimensions, which do not scale with body size. Consequently, 
phyllostomids are not under the constraints resulting in size-frequency 
scaling in mouth-emitting species. Overall, our results depict bat echo- 
location as not only a viable substitute for vision under conditions of 
uncertain lighting, but as having unique advantages owing to flexible 
control of field of view. 


METHODS SUMMARY 


We recorded one Eptesicus serotinus, three Myotis dasycneme, three M. daubento- 
nii, two Nyctalus noctula, three Pipistrellus pygmaeus and two Vespertilio murinus 
as they oriented in a 7 X 4.8 X 2.5 m flight room. Calls were recorded with twelve 
0.25 inch 40BF G.R.A.S. microphones, amplified by 12 AA G.R.A.S. amplifiers, and 
sampled at 350 kHz per channel by an Avisoft 1216 Ultrasound Gate. Seven micro- 
phones were positioned horizontally 50 cm apart and five were positioned vertically 
40 cm apart (two above and three below the central horizontal microphone). 

We localized bats at each call using microphone arrival-time differences, com- 
pensating for transmission loss (spherical-spreading loss, atmospheric attenuation® 
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and angle on microphones”) by filtering the recorded signal by the impulse res- 
ponse of transmission loss. We filtered each call using a one-octave sixth order 
band-pass filter (ANSI-S1.1-1986-Standard) starting at the —6 dB low-frequency 
cut-off (Fig. 2 and Supplementary Table 1), which weighs directionality according 
to call energy distribution. We also calculated maximum beam width at minimum 
frequency (—6 dB low-frequency cut-off) and found similar directionality indices 
(8.9-10.4 dB). Root mean squared pressure of each compensated signal calculated 
for 95% energy content of 7.5 ms inclusive segments. 

Beam aim was calculated in azimuth and elevation, using calls on-axis with the 
centre microphone. We fitted the piston model to emission pattern extracting 
equivalent piston radius, a, for each call. 

We took gape heights for two flying M. daubentonii from the open-access 
repository http://www.ChiRoPing.org/data/index. Bats were video recorded at 
500 frames per second and vertical gape heights measured. Eight flights per indi- 
vidual were analysed (Supplementary Fig. 1). 

Skull photos (nm = 4-7 per species) were exported to Image J v.1.38x, measure- 
ments made as described in ref. 18. We estimated gape height as: 


A 
Bepecies 
A 


Gape heightspecies = | a? +b? —2 x ax bx cos} 90° x 


Biucifugus 


ais upper jaw length, b lower jaw length, and A/B the ratio between distance from 
craniomandibular joint to origin and insertion of the superficial masseter (Fig. 4). 
Forearm measurements were taken from wet specimens (n = 10 per species). 

Phylogenetically independent contrasts were generated using combined 
molecular phylogenies’, actual branch lengths and the Crunch procedure 
(CAIC v.2.6.9, ref. 20). 


Received 7 June; accepted 9 October 2012. 
Published online 21 November 2012. 


1. Jones, G. Scaling of echolocation call parameters in bats. J. Exp. Biol. 202, 
3359-3367 (1999). 

2. Barclay, R. M. R. & Brigham, R. M. Prey detection, dietary niche breadth, and body 
size in bats: why are aerial insectivorous bats so small? Am. Nat. 137, 693-703 
(1991). 

3. Houston, R. D., Boonman, A. M. & Jones, G. Do echolocation signal parameters 
restrict bats’ choice of prey? In Echolocation in Bats and Dolphins (eds Thomas, J. A., 
Moss, C. F. & Vater, M) 339-345 (Chicago Univ. Press, 2004). 

4. Mogensen, F. & Mahl, B. Sound radiation patterns in the frequency domain of cries 
from a vespertilionid bat. J. Comp. Physiol. 134, 165-171 (1979). 

5. Dietz, C., von Helversen, O. & Nill, D. Handbuch der Fledermaduse Europas und 
Nordwestafrikas: Biologie, Kennzeichen, Gefahrdung 1st edn (Kosmos, 2007). 

6. Bradbury, J. W. & Vehrencamp, S. L. Principles of Animal Communication 2nd edn 
(Sinauer Associates, 2011). 

7. Suthers, R. A. Vocal mechanisms in birds and bats: a comparative view. An. Acad. 
Bras. Cienc. 76, 247-252 (2004). 

8. ANSI standard S1. Method for the Calculation of the Absorption of Sound by the 
Atmosphere. Report No. ANSI S1 26-1995 (ASA 113-1995) Acoustic. Soc. Am. 
(1995). 

9. Lawrence, B. D. & Simmons, J. A. Measurements of atmospheric attenuation at 
ultrasonic frequencies and the significance for echolocation by bats. J. Acoust. Soc. 
Am. 71, 585-590 (1982). 

10. Pye, J. D. Is fidelity futile? The 'true’ signal is illusory, especially with ultrasound. 
Bioacoustics 4, 271-286 (1993). 

11. Waters, D.A., Rydell, J.& Jones, G. Echolocation call design and limits on prey size: 
a case study using the aerial-hawking bat Nyctalus leisleri. Behav. Ecol. Sociobiol. 
37, 321-328 (1995). 


96 | NATURE | VOL 493 | 3 JANUARY 2013 


12. Surlykke, A., Pedersen, S. B. & Jakobsen, L. Echolocating bats emit a highly 
directional sonar sound beam in the field. Proc. R. Soc. B 276, 853-860 (2009). 

13. Land, M. F. & Nilsson, D. E. Animal Eyes 2nd edn (Oxford Univ. Press, 2012). 

14. Ghose, K. & Moss, C. F. Steering by hearing: a bat’s acoustic gaze is linked to its 
flight motor output by a delayed, adaptive linear law. J. Neurosci. 26, 1704-1710 
(2006). 

15. Jakobsen, L. & Surlykke, A. Vespertilionid bats control the width of their biosonar 
sound beam dynamically during prey pursuit. Proc. Natl Acad. Sci. USA 107, 
13930-13935 (2010). 

16. Moss, C. F., Chiu, C. & Surlykke, A. Adaptive vocal behavior drives perception by 
echolocation in bats. Curr. Opin. Neurobiol. 21, 645-652 (2011). 

17. Schnitzler, H.-U. & Kalko, E. K. V. Echolocation by insect-eating bats. Bioscience 51, 
557-569 (2001). 

18. Herring, S.W. & Herring, S. E. The superficial masseter and gape in mammals. Am. 
Nat. 108, 561-576 (1974). 

19. Kallen, F.C. & Gans, C. Mastication in little brown bat, Myotis /ucifugus. J. Morphol. 
136, 385-420 (1972). 

20. Purvis, A. & Rambaut, A. Comparative analysis by independent contrasts (CAIC): 

an Apple Macintosh application for analysing comparative data. Comp. Appl. 

Biosci, 11, 247-251 (1995). 

21. Norberg, U. M. & Rayner, J. M. V. Ecological morphology and flight in 

bats (Mammalia; Chiroptera): wing adaptations, flight performance, 

foraging strategy and echolocation. Phil. Trans. R. Soc. B 316, 335-427 

(1987). 

22. Simmons, J.A., Fenton, M. B. & O'Farrell, M. J. Echolocation and pursuit of prey by 

bats. Science 203, 16-21 (1979). 

23. Baagge, H. J. in Recent Advances in the Study of Bats (eds Fenton, M. B., Racey, P. & 

Rayner, J. M. V.) 57-74 (Cambridge Univ. Press, 1987). 

24. Jones, G.& Teeling, E.C. The evolution of echolocation in bats. Trends Ecol. Evol. 21, 
149-156 (2006). 

25. Wotton, J. M., Jenison, R. L. & Hartley, D. J. The combination of echolocation 
emission and ear reception enhances directional spectral cues of the big brown 
bat, Eptesicus fuscus. J. Acoust. Soc. Am. 101, 1723-1733 (1997). 

26. Briiel & Kjzer. Condenser Microphones and Microphone Preamplifiers for Acoustic 
Measurements. Data Handbook (Briel & Kjzer, 1982). 

27. Ruedi, M. & Mayer, F. Molecular systematics of bats of the genus Myotis 
(Vespertilionidae) suggests deterministic ecomorphological convergences. 
Mol. Phylogenet. Evol. 21, 436-448 (2001). 

28. Hoofer, S.R. & Van den Bussche, R. A. Molecular phylogenetics of the chiropteran 
family Vespertilionidae. Acta Chiropterol. 5, 1-63 (2003). 

29. Hulva, P., Horacek, |., Strelkov, P. P. & Benda, P. Molecular architecture of 
Pipistrellus pipistrellus/Pipistrellus pygmaeus complex (Chiroptera: 
Vespertilionidae): further cryptic species and Mediterranean origin of the 
divergence. Mol. Phylogenet. Evol. 32, 1023-1035 (2004). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We thank H. Baagge and B. Fenton for access to the bat collections 
at the Natural History Museum of Denmark and the Royal Ontario Museum, 
respectively, and R. Fisher and J. Hallam for providing gape heights of bats in flight. 
S. Brinklav, C. Elemans, B. Falk, B. Fenton, J. Galef, P. Madsen, C. Moss, H.-U. Schnitzler 
and M. Wahlberg provided detailed comments that improved the manuscript. This 
study was funded by the Danish Council for Natural Sciences (FNU), Carlsberg, Oticon 
and the European Commission via the Seventh Framework Programme project 
ChiRoPing, Information Society Technologies Contract 215370. Animal capture and 
experimentation was approved by Skov- og Naturstyrelsen (Denmark). 


Author Contributions L.J. was responsible for conducting the experiments and 
programming. All authors contributed to data analyses and the writing of the 
manuscript. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to A.S. (ams@biology.sdu.dk). 


©2013 Macmillan Publishers Limited. All rights reserved 


LETTER 


doi:10.1038/nature11665 


Inhibition dominates sensory responses in the 


awake cortex 


Bilal Haider’, Michael Hausser? & Matteo Carandini! 


The activity of the cerebral cortex is thought to depend on the pre- 
cise relationship between synaptic excitation and inhibition’. In 
the visual cortex, in particular, intracellular measurements have 
related response selectivity to coordinated increases in excitation 
and inhibition®°. These measurements, however, have all been 
made during anaesthesia, which strongly influences cortical state”® 
and therefore sensory processing”'"*°. The synaptic activity that is 
evoked by visual stimulation during wakefulness is unknown. Here 
we measured visually evoked responses—and the underlying syn- 
aptic conductances—in the visual cortex of anaesthetized and awake 
mice. Under anaesthesia, responses could be elicited from a large 
region of visual space’® and were prolonged. During wakefulness, 
responses were more spatially selective and much briefer. Whole- 
cell patch-clamp recordings of synaptic conductances”’’ showed a 
difference in synaptic inhibition between the two conditions. Under 
anaesthesia, inhibition tracked excitation in amplitude and spatial 
selectivity. By contrast, during wakefulness, inhibition was much 
stronger than excitation and had extremely broad spatial selectivity. 
We conclude that during wakefulness, cortical responses to visual 
stimulation are dominated by synaptic inhibition, restricting the 
spatial spread and temporal persistence of neural activity. These 
results provide a direct glimpse of synaptic mechanisms that control 
sensory responses in the awake cortex. 

To investigate how wakefulness affects the synaptic basis of visual 
selectivity, we made local field potential (LFP) recordings and whole- 
cell recordings of membrane potential (V,,) in layer 2/3 of the primary 
visual cortex (V1) in both anaesthetized and awake mice. 

We first examined spontaneous activity and found that this activity 
was markedly affected by wakefulness (Fig. 1a, b). Under two widely 
used anaesthetic regimes, slow fluctuations in both V,, and LFP’* 
were common (Fig. la and Supplementary Fig. 4). During wakeful- 
ness, these slow fluctuations were abolished and were replaced by 
higher frequency activity of both V,, and LFP (Fig. 1b and 
Supplementary Fig. 1). In nearly all cases, V,, was distributed 
unimodally during waking’’’® but bimodally during anaesthesia 
(Supplementary Fig. 1a, b). Spontaneous firing rates were similarly 
low in the two conditions (anaesthetized, 0.3+0.2 spikes sy 
n=14; awake, 0.1 + 0.1 spikes st, n=14; P= 0.07). These 
results indicate that in awake mice, V1 rarely shows the spontane- 
ous fluctuations that are common during anaesthesia or sleep’7° 
and that have been reported in area S1 of quietly awake mice’’. 

We next probed visual responses with flashed bars and found that 
wakefulness had a striking effect on response duration (Fig. 1c-f). 
Briefly flashed bars (100 ms duration, at 1.5 intervals) elicited long- 
lasting LFP responses under anaesthesia (Fig. 1c; 553 + 22 ms, n = 7 
mice) and much briefer responses during wakefulness (Fig. 1d; 
171+11ms, n=7; P<0.001). This striking difference in LFP re- 
sponse duration was observed across the depth of the cortex (Sup- 
plementary Fig. 2). Awake V,, responses were also rapidly truncated 
(Fig. 1h; 148 + 31 ms, n = 14) compared with anaesthetized responses 


(Fig. 1g; 553+73ms, n=14; and Supplementary Fig. 3). The 
prolonged responses in both V,, and LFP were remarkably similar 
across anaesthetic regimes and persisted regardless of the depth of 
anaesthesia (Supplementary Figs 4 and 5). 

This marked difference in awake and anaesthetized responses was 
not confounded by spontaneous alternations of excitability that are 
present during anaesthesia (Fig. 2). We asked whether responses under 
anaesthesia differed when neurons were spontaneously hyperpola- 
rized (down) or depolarized (up)’. After correcting for the tendency 
of V,, to spontaneously alternate between these two states (Fig. 2c), we 
found that during anaesthesia, the V,,, responses evoked from either 
state (hyperpolarized or depolarized) were remarkably similar in 
amplitude and duration (Fig. 2e) and were much longer than responses 
during wakefulness (Fig. 2f). 

Responses in awake mice were more selective across visual space than 
responses under anaesthesia (Fig. 3a—e). Vin responses were twice as 
spatially selective during wakefulness as under anaesthesia. This differ- 
ence in spatial localization was even more pronounced for spikes*! 
(Fig. 3e), even when we accounted for sustained responses during 
anaesthesia by restricting the spike counts to the earliest portion of 
the sensory response (0-200 ms; Supplementary Fig. 6). 

We observed fewer visually evoked spikes during waking than 
during anaesthesia (Fig. 3b). This difference was particularly evident 
during the stimulation of regions that surround the centre of the 
receptive field. Under anaesthesia, stimuli in these regions evoked 
significantly more firing (0.6 + 0.2 spikes per trial) than did blank sti- 
muli (0.3 + 0.2 spikes per trial; P< 0.001). By contrast, in awake ani- 
mals, stimuli in this region produced no net increase in spikes above 
spontaneous activity (0.08+0.03 for surround stimuli versus 
0.08 + 0.04 for blank stimuli; P = 0.89). 

The lower spike counts observed during wakefulness were also as- 
sociated with a reduced variability in V,, responses (awake s.d., 
3.5+0.03mV, n=14; anaesthetized sd, 44+0.03mV, n= 14; 
P<0.001). This lower variability reduced the number of threshold cros- 
sings of V,,, in awake animals, despite peak responses that were, on 
average, more depolarized than those during anaesthesia (Fig. 3d). The 
spike threshold did not differ between wakefulness (—40.0 + 1.1 mV) 
and anaesthesia (—40.0 + 1.2mV; P= 0.6). However, spikes evoked 
during wakefulness were quickly followed by a significant and long- 
lasting hyperpolarization (Fig. 3f). We hypothesized that this hyperpo- 
larization—and indeed the finer spatiotemporal resolution of the awake 
responses—was indicative of enhanced synaptic inhibition. 

To test this hypothesis, we blocked the intrinsic conductances”? in 
single neurons and recorded synaptic currents in voltage-clamp mode 
near the reversal potentials for glutamate-mediated excitation and 
GABA (y-aminobutyric acid)-mediated inhibition (Supplementary 
Fig. 7). We then estimated the relative change in total conductance 
(AG; Supplementary Figs 7 and 8) visible at the soma* in response 
to visual stimulation. Resting conductance and peak-evoked con- 
ductance were unaffected by wakefulness (Supplementary Fig. 7). 
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Figure 1 | Spontaneous and evoked activity in 
the anaesthetized and awake visual cortex (V1). 
a, V,, and simultaneous LFP measured in V1 under 
anaesthesia. The spikes are shown truncated at 
+20 mV.b, V,, and simultaneous LFP measured in 
V1 in awake animals. ¢, d, Visually evoked LFP 
responses across space during anaesthesia (c) and 
wakefulness (d). An average of 15 trials per location 
was used. Top, the dashed line indicates the best 
location. Bottom, single trial responses (grey) and 
the average (mean + s.e.m.) response (black or 


c d Jstim 
Cc 

S -36 

Q -18 

a8 0 

3S 

22 18 

£ 36 

a 


green) with the stimulus (stim) at the best location. 
deg, degrees. e, V,, responses to stimuli at the best 
location while under anaesthesia: single trials 
(grey) and mean (+ s.e.m.; black). The spikes are 
shown truncated at —30 mV. f, Vin responses to 
stimuli at the best location during wakefulness. 
c-f, n = 4 different mice. g, h, Normalized 
probability distributions of V,, response durations 
to stimuli at the best location, across the population 
(n = 14), measured under anaesthesia (g) and 
during wakefulness (h). The arrow indicates the 
mean duration. 
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Therefore, the differences between awake and anaesthetized V,,, must 
result from changes in the relative strength of the excitatory conduc- 
tance (AG,) and inhibitory conductance (AG;). 

Under anaesthesia, the estimated excitatory conductance and in- 
hibitory conductance behaved as expected from previous stud- 
ies'°'78172223.2526 (Fig, 4a). On stimulation, increased excitation was 
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V,, response duration (s) 


quickly followed by inhibition. Then, excitation and inhibition co- 
varied at a sustained and elevated level for hundreds of milliseconds 
after the stimulus offset. In other words, excitation and inhibition were 
balanced in that they had a similar amplitude and time course. 
During wakefulness, by contrast, inhibition dominated excitation 
during the entire time course of the visual response (Fig. 4b). During 


Figure 2 | Anaesthetized responses are long 
lasting regardless of cortical state. 

a, b, Normalized probability distributions of 
spontaneous V,,, during anaesthesia (n = 14 
neurons) (a) and wakefulness (n = 14 neurons) 
(b). c, Mean (= s.e.m.) anaesthetized V,,, responses 
(solid lines), sorted by pre-stimulus V,, level. 
Depolarized (grey) and hyperpolarized (black) 
groups are shown (with the mean V,, of the two 
groups indicated by arrows of the corresponding 
colour in a). The pre-stimulus baseline V,,, was 
subtracted before averaging. The dashed lines 
indicate the average spontaneous V,, (in response 
to blank stimuli), sorted similarly into two groups 
(n = 14 neurons). d, As for c, during wakefulness 
(n = 14). e, Average anaesthetized V,,, response for 
hyperpolarized (black) and depolarized (grey) 
trials, after subtraction of spontaneous V,,, traces. 
f, Average awake V,, response for all trials, after 
subtraction of spontaneous V,,, traces. The grey line 
shows the average of all of the anaesthetized 
responses (for comparison). 
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Figure 3 | Responses are spatiotemporally restricted during waking. a, The 
number of spikes evoked per trial (normalized to each neuron’s response at 
the best location). Symmetrical locations on either side of 0° were combined. 
The centre is defined as 0° + 9°, and the surround is defined as +18° to 
+45°. The response window is defined by the average duration of the 
population’s V,,, response (Fig. 1g, h) (n = 14 for each group). b, Under 
anaesthesia (anaesth, black), centre and surround stimuli evoked more 
spikes (P< 0.001 for both) than during spontaneous activity (dashed line). 
During wakefulness (green), there were fewer spikes than under 
anaesthesia (P< 0.001 for both stimulus locations); the centre stimuli 
evoked more spikes than did the surround stimuli (P < 0.009), and the 
surround stimuli did not evoke a significantly different response from 
spontaneous activity. Nine of 14 neurons were active during anaesthesia, and 
5 of 14 were active during waking. c, As for a, for peak V,, responses 
(normalized to each neuron’s response at the best location). d, As for b, for 
peak V,, responses. The responses to centre stimuli were greater than to 
surround stimuli in both anaesthetized and awake animals (P < 0.04 for both 
conditions), and all responses were greater than spontaneous activity 
(dashed lines, P< 0.001 for all). The awake responses were larger than the 
anaesthetized responses (P < 0.001 for both stimuli locations). e, The V,, 
and the spike responses were more spatially selective during waking 

(Vi anaesthetized, 0.3 + 0.1; awake, 0.6 + 0.1; P< 0.001; and spikes, 
anaesthetized, 0.1 + 0.1; awake, 0.6 + 0.2; P< 0.001). f, The spike-triggered 
average of V,, under anaesthesia and during wakefulness. The spike 
threshold (the peak of the second derivative of V.,) was aligned at 0 mV. 
a-f, mean = s.e.m. 


the initial 100 ms of the response, the ratio of inhibition (AG;) to 
excitation (AG,) was 1.4-2.9 (interval defined by the geometric 
s.e.m. around the geometric mean, see Methods), which is signifi- 
cantly larger than the ratio measured under anaesthesia (0.7-1.0) 
(P<0.05, one-tailed two-sample t-test; Supplementary Fig. 9). 
Inhibition remained above baseline significantly longer than did 
excitation (by 29+13ms; P<0.001), but both excitation and 
inhibition disengaged within 200 ms of the stimulus, mirroring the 
rapid termination of awake V,, and LFP responses observed earlier 
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Figure 4 | Visually evoked conductance is dominated by inhibition in awake 
V1. a,b, AG, (red) and AG; (blue) evoked by centre stimulation during 
anaesthesia (n = 5) and waking (n = 8). c, AG, and AG; evoked by surround 
stimulation during anaesthesia (c) and waking (d) (for the same neurons as in 
aand b). e, Spatial profiles of excitation and inhibition under anaesthesia (left) 
and wakefulness (right). AG, and AG; were normalized to peak AG, for centre 
stimuli (grey dashed line) for each neuron and then averaged across the 
population. Data were fitted with a Gaussian function (mean = s.d., shaded) or 
a linear function (for the awake G;; mean + s.d.). Scale bar, 18° width across the 
receptive field centre. a-d, mean + s.e.m. 


(compare with Figs 1 and 2). Wakefulness also reduced the ampli- 
tude of AG,, presumably because the intracortical sources of excita- 
tion were themselves subject to the same enhanced inhibition as the 
recorded neuron. 

Strikingly, in awake animals, visually evoked inhibition was strongly 
activated even by stimuli that were placed far from the receptive field 
centre (Fig. 4d and Supplementary Fig. 9). During anaesthesia, by 
contrast, placing stimuli in these surrounding regions evoked little 
inhibition (Fig. 4c). Across all recordings in anaesthetized mice 
(n = 6), AG, over the first 100 ms of the response was 1.8-2.4-fold 
larger for stimuli in the centre than in the surround (AG; centre/ 
surround ratio significantly >1; P < 0.005, one-tailed t-test). In awake 
mice, AG; was remarkably unselective for position: it was not signifi- 
cantly different in amplitude regardless of whether it was elicited from 
the centre or from the surrounding regions (P = 0.07). In fact, in every 
neuron recorded during wakefulness (n = 16; Supplementary Fig. 9), 
the AG; to AG, ratio for surround stimulation was greater than 1; for all 
recorded neurons, the AG;/AG, ratio evoked by surround stimuli was 
0.8-1.1 under anaesthesia and a much larger 2.7-3.5 during wakeful- 
ness (P< 0.01). 

Although retinotopy is the primary determinant of sensory res- 
ponses in the visual cortex, synaptic conductances are also known to 
depend on stimulus orientation®*”*”*. We asked whether changing the 
stimulus orientation would have an effect on the observations. The 
results were similar to those described above: regardless of the stimulus 
orientation during wakefulness, V,,, responses were brief, and inhibi- 
tion dominated across visual space (Supplementary Figs 10 and 11). 
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Taken together, these data identify a novel characteristic of awake 
cortical processing—elevated and spatially extended inhibition—that 
is associated with sensory responses that are more spatiotemporally 
selective (Fig. 4e). Previous measurements of sensory responses in 
anaesthetized animals have led to debate about the role of inhibi- 
tion’ >7817225256: our findings show that inhibition is a decisive 
factor in the awake cortex: it dominates excitation in amplitude and 
over time (Fig. 4b, d) and is evoked from regions of visual space that 
extend far beyond the central regions of the receptive field (Fig. 4e). 
This finding of increased inhibition during wakefulness is consistent 
with earlier suggestions''”””* and indicates a regime of sensory proces- 
sing that cannot be observed during anaesthesia or sleep, in which 
more-balanced excitation and inhibition are evoked from large regions 
of space and persist long after the stimulus has disappeared. The 
increased inhibition in the awake cortex is ideally poised to extinguish 
any spatial or temporal spread of feedforward activity that is elicited 
by a sensory input. Accordingly, during wakefulness, we observed 
a brisk and highly selective impulse response to spatially localized 
visual stimuli. 

At present, it is unclear which factors regulate the strength of inhibi- 
tion in the awake cortex. Neuromodulators can desynchronize LFP 
and V,,°, depolarize interneurons*’, and alter response reliability 
and sensory perception’®'*”. It will be important to examine such 
factors in this context, including the contributions of laminar connec- 
tivity and interneuron subtypes'* to the increased inhibitory con- 
ductances that we observed during wakefulness. Having identified 
inhibition as a major determinant of spatially selective and temporally 
succinct visual responses in the awake cortex, we suggest that beha- 
vioural factors such as attention and reward may also exert their influ- 
ence by modulating inhibition. 


METHODS SUMMARY 


All recordings were performed in layer 2/3 of monocular V1 (0.5 mm anterior 
and 2.0 mm lateral from lambda) in female C57BL/6] mice (4-6 weeks of age). 
Anaesthesia was induced with 10° mg chlorprothixene per kg body weight and 
1.5 mg urethane per kg body weight (Figs 1-4) or with 10-° mg chlorprothixene 
per kg body weight and 0.25-1% isoflurane in O2. Awake mice were habituated to 
head fixation over 4-5 days. The LFP was recorded with pipettes filled with 
HEPES-buffered artificial cerebrospinal fluid (which consisted of 135mM NaCl, 
54mM KCl, 5mM HEPES, 1mM MgCl, and 1.8mM CaCl; pH7.3). Patch 
pipettes (4-7 MQ) were filled with standard internal solution (135 mM potassium 
gluconate, 6mM KCl, 10mM HEPES, 4mM MgATP, 0.3 mM Naj,ATP, 0.1 mM 
EGTA and 8mM phosphocreatine; pH adjusted to 7.3; 290-295 mOsM) for 
current-clamp recordings. For voltage-clamp recordings, a 140mM caesium- 
methanesulphonate-based solution also included 0.5 mM QX-314 and 5 mM te- 
traethylammonium (TEA)””*. V,, was not corrected for the junction potential. 
The series resistance was compensated online and was monitored throughout 
voltage-clamp recordings (anaesthetized mice, 15+ 2MQ; and awake mice, 
16 + 2 MQ). Conductance was calculated using the instantaneous current-voltage 
relationship at two holding potentials (near —80 and +20 mV), relative to base- 
line, as previously described*!””’. Vertically oriented black and white bars (9° wide, 
100% contrast, 100 ms duration and 1.5 s interstimulus interval) were presented 
monocularly, one at a time in randomly chosen positions (+45°). Spatial selecti- 
vity was defined as (Reentre ~ Rsurrouna)/(Reentre + Rsurrouna)» Where R is the peak 
Vin (or number of spikes per trial) averaged across the centre or surround 
locations. Mean + s.e.m. is reported throughout, unless noted. For ratios, we used 
the geometric mean and geometric s.e.m., defined as exp{mean[log(ratios)]} and 
exp{s.e.m.|[log(ratios)]}. Statistical testing (« = 0.05) was carried out using Wil- 
coxon signed-rank tests (paired data), rank-sum tests (unpaired data) and sign 
tests (difference from unity), unless noted. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Female C57BL/6] mice aged 4-6 weeks were used in both anaesthetized and awake 
recordings, and all procedures were performed under license from the UK Home 
Office in accordance with the Animal (Scientific Procedures) Act 1986. 
Recordings under anaesthesia. Anaesthesia was induced with 10° ° mg chlor- 
prothixene per kg body weight and 1.5 mg urethane per kg body weight (10% w/v 
dissolved in lactated Ringer’s solution, administered intraperitoneally) (for all data 
presented in Figs 1-4) or with 10° mg chlorprothixene per kg body weight and 
0.25-1% isoflurane in Op). In addition, 0.07 mg atropine sulphate per kg body 
weight was injected subcutaneously to maintain clear airways; 2 mg dexametha- 
sone per kg body weight was administered intramuscularly to prevent oedema; 
and 4 mg Rimadyl per kg body weight was administered subcutaneously as an 
analgesic and anti-inflammatory. Body temperature was maintained at 37.1 °C 
with a feedback-regulated heating pad. A tracheotomy was performed (during 
urethane experiments), and animals were passively ventilated with pure O. The 
stimulated eye was protected with a gas-permeable contact lens. The unstimulated 
eye was gently sutured shut. A small head plate with a chamber was implanted over 
monocular regions of the visual cortex (0.5 mm anterior and 2 mm lateral from the 
lambda suture). Two small (<0.5mm) craniotomies were drilled (within 200- 
400 tum of each other) for the separate entry of LFP and patch pipettes. A third 
craniotomy was drilled over the anterior retrosplenial cortex for the insertion of an 
Ag/AgCl reference wire. The dura was carefully removed immediately before 
insertion of whole-cell pipettes. Warm HEPES-buffered artificial cerebrospinal 
fluid (aCSF, 135mM NaCl, 5.4mM KCl, 5mM HEPES, 1mM MgCl and 
1.8mM CaCl; pH_7.3) filled the chamber to prevent desiccation and maintain 
ionic balance. 

Recordings during wakefulness. Awake mice were implanted with a lightweight 
head plate with a chamber under isoflurane anaesthesia and were allowed to 
recover for 2 days. On the third day, animals were acclimated to head fixation 
for 10-60 min, with a sweet-liquid reward given at the start and end of training. 
The duration of training was increased over 4-5 days until mice were sitting 
comfortably for 3h. On the day of recording, craniotomies, as described above, 
were performed under isoflurane anaesthesia. The animal was allowed to recover 
for 3h, and a single 3h recording session commenced. The dura was opened 
immediately before recording. Mice showed frequent grooming, whisking and 
postural adjustments during experiments. They were also given a sweet-liquid 
reward every 30 min to keep them comfortable and hydrated. 

Recordings. Patch pipettes (4-7 MQ) were pulled on a PC-10 pipette puller 
(Narishige) and filled with internal solution containing 135 mM potassium gluco- 
nate, 6mM KCl, 10 mM HEPES, 4mM MgATP, 0.3 mM Na,ATP, 0.1 mM EGTA 
and 8mM phosphocreatine (pH adjusted to 7.3 with KOH, 290-295 mOsM). 
For voltage-clamp recordings, potassium gluconate was substituted with 140 mM 
caesium methanesulphonate. QX-314 (0.5 mM) and tetraethylammonium (TEA) 
(5 mM) were included to block voltage-gated Na* and K* conductances. While 
searching for neurons, the diffusion of blockers was limited by using low pressures 
(20-30 mbar); more importantly, the LFP electrode served as an internal control 
for visual responsiveness. LFP response amplitudes and tuning were unaltered 
after using pipettes containing channel blockers. 

Whole-cell recordings were obtained with a MultiClamp 700B amplifier in 
voltage-clamp mode using standard techniques*’. Pipette capacitance was neutra- 
lized before break-in. The average seal resistances for anaesthetized and awake 
voltage-clamp recordings were 3.8 + 1.7 GQ and 6.3 + 1.3 GQ, respectively. The 
average seal resistances for anaesthetized and awake current-clamp recordings 
were 4.5 + 1.2GQ and 5.3 + 1.1 GQ, respectively. The initial access resistance 
ranged from 9 MQ to 30 M. This value generally increased within a few minutes 
and then stabilized. If the initial access was greater than 50 MQ, the recording was 
not included for further analysis. Access resistance during visual stimulation did 
not differ between anaesthetized and awake recordings in current-clamp experi- 
ments (anaesthetized mice, 36 + 6 MQ; and awake mice, 39 + 3 MQ) or voltage- 
clamp experiments (anaesthetized mice, 38 + 5 MQ; and awake mice, 38 + 4 MQ). 
Access resistance was compensated online during voltage-clamp recordings and 
was optimized manually. Series compensation averaged 63 + 1%, yielding an 
effective series resistance of 15+ 2MQ and 16 + 2MQ during anaesthetized 
and awake voltage-clamp recordings, respectively. The junction potential was 
not corrected. The voltage division through the uncompensated series resistance 
was corrected offline’. 

This method limits the distortion of the somatically recorded current by 
ongoing voltage fluctuations and membrane capacitance*”’; however, because of 
the necessity for intrinsic current blockade, it also masks potential contributions 
from active dendritic conductances”. This method also requires multiple trials 
and holding potentials; it thus cannot be used to simultaneously assess the rela- 
tionship between excitation and inhibition within single trials”. 
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As in previous studies”, to better nullify excitatory currents, we used a holding 
potential of about +20 mV rather than 0 mV, as this mitigates the voltage decay 
across the dendrites. Current-clamp recordings had 0 current injection during 
stimulation protocols. Input resistance and series resistance were monitored be- 
tween protocols with current pulse trains or voltage steps. Firing rate adaptation to 
supra-threshold pulses and broad spike widths confirmed that our recordings were 
from regular-spiking pyramidal neurons”. Across all anaesthetized (n = 22; 
—305 + 27 sm) and awake (n = 30; —279 + 27 tm) whole-cell recordings, there 
was no significant difference in laminar depth (P = 0.94), as estimated from the 
micromanipulator reading after it was zeroed upon contact with the cortical surface. 
Acquisition, visual stimulation and analysis. All analyses and acquisitions were 
performed in MATLAB. V,, was low-pass filtered at 20 kHz. During voltage- 
clamp experiments, membrane current (I,,) was low-pass filtered at 2 kHz. LFP 
was filtered from 0.1-100 Hz. Data were acquired using a National Instruments 
board and were synchronized to stimulus onset with a photodiode signal. 

The liquid crystal display monitor was positioned 24 cm from the mouse and at 
0° elevation and 30-45° azimuth. The azimuthal position was adjusted so that the 
centre of the LFP receptive field was roughly centred on the screen. Stimuli were 
presented ona grey background and randomly interleaved across space, with 8-20 
repetitions per stimulus location. A blank screen (the grey background) was ran- 
domly presented every 11 stimulus presentations for the same duration as the 
stimulus trials. Stimuli were delivered monocularly. The unstimulated eye was 
gently sutured (anaesthetized mice) or shielded by black aluminium foil (awake 
mice). Throughout the entire recording session, the monitor was continuously illu- 
minated with the same grey screen. This was to ensure that the spontaneous cortical 
state was not affected by transitions to and from complete darkness of the monitor. 

In the awake recordings, on establishing a stable whole-cell configuration, 
1-2 min of spontaneous activity was recorded before the presentation of any 
flashed bars. These data were used for all of the calculations of spontaneous activity 
(Figs 1a, b and 2a, b and Supplementary Figs 1 and 4); there were no systematic 
differences between the interleaved blank data and spontaneous data recorded 
before any visual stimulation. In some awake protocols, we shortened the inter- 
stimulus interval to 300 ms because the evoked responses returned to baseline well 
before this interval. There was no difference between the longer (1.5 s) and shorter 
interstimulus interval (n = 3) in terms of the magnitude or duration of evoked 
conductances. For measurements of V,, (Figs 2 and 3 and Supplementary Fig. 3), 
any spikes were median filtered (7 ms), and the resultant trace was smoothed. 

In a subset of experiments, we optimized the orientation of the flashed bars for 
each individual neuron (Supplementary Fig. 10) by first presenting full-screen 
drifting gratings (50% contrast, spatial frequency of 0.03 cycles per degree, tem- 
poral frequency of 2 Hz and 2 s duration) that varied randomly in orientation (30 
degree steps). The preferred orientation was designated as the orientation that 
evoked the largest number of spikes (or the largest depolarization of V,,), whereas 
the orthogonal orientation was 90° away from the preferred orientation. Bars that 
were identical to those in all other experiments (9° width, 100 ms duration, and 
1.5s or 0.3 interstimulus interval) were then presented randomly across space 
along the two axes defined by the preferred and orthogonal orientations. 

To obtain robust estimates of reversal potential (V;-y) and AG during the 
responses, we considered the response onset as the time when AG was 5-10% 
above baseline for 10 ms consecutively. Across the population of neurons, this time 
point occurred 30-40 ms after stimulus onset, so V,.y, AG, AG, and AG, were 
averaged starting 40 ms after stimulus onset, across the population (Fig. 4; 1 = 5 
urethane anaesthesia; n = 8 awake; both groups were tested identically with ver- 
tically oriented bars). Additional recordings of conductances under isoflurane 
anaesthesia (n = 1) and in awake mice that were presented with bars varying in 
orientation (m = 4) were not included in these plots to maintain equivalent group 
comparisons (but see Supplementary Fig. 9). 

The state dependence of visual responses during anaesthesia (Fig. 2) was ana- 
lysed by sorting trials (within neurons) by the pre-stimulus V,, level. The up- 
per- and lower-most quartiles (that is, the 25% most depolarized and 25% most 
hyperpolarized non-overlapping trials) were then averaged across the population, 
and these largely correspond to trials in which the stimuli were delivered in the up 
state and the down state, respectively’*. Blank stimuli were also sorted in this 
manner to estimate the spontaneous state transitions from down to up, and vice 
versa, in the absence of sensory stimulation. 
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Scaling of embryonic patterning based on 


phase-gradient encoding 


Volker M. Lauschke!*, Charisios D. Tsiairis'*, Paul Francois” & Alexander Aulehla! 


A fundamental feature of embryonic patterning is the ability to 
scale and maintain stable proportions despite changes in overall size, 
for instance during growth’®. A notable example occurs during 
vertebrate segment formation: after experimental reduction of 
embryo size, segments form proportionally smaller, and conse- 
quently, a normal number of segments is formed’”*. Despite 
decades of experimental’”’ and theoretical work, the underlying 
mechanism remains unknown. More recently, ultradian oscilla- 
tions in gene activity have been linked to the temporal control of 
segmentation”; however, their implication in scaling remains elu- 
sive. Here we show that scaling of gene oscillation dynamics under- 
lies segment scaling. To this end, we develop a new experimental 
model, an ex vivo primary cell culture assay that recapitulates mouse 
mesoderm patterning and segment scaling, in a quasi-monolayer of 
presomitic mesoderm cells (hereafter termed monolayer PSM or 
mPSM). Combined with real-time imaging of gene activity, this 
enabled us to quantify the gradual shift in the oscillation phase 
and thus determine the resulting phase gradient across the mPSM. 
Crucially, we show that this phase gradient scales by maintaining a 
fixed amplitude across mPSM of different lengths. We identify the 
slope of this phase gradient as a single predictive parameter for 
segment size, which functions in a size- and temperature-independ- 
ent manner, revealing a hitherto unrecognized mechanism for scal- 
ing. Notably, in contrast to molecular gradients, a phase gradient 
describes the distribution of a dynamical cellular state. Thus, our 
phase-gradient scaling findings reveal a new level of dynamic 
information-processing, and provide evidence for the concept of 
phase-gradient encoding during embryonic patterning and scaling. 

The sequential formation of body segments in vertebrates and arth- 
ropod species has been linked to oscillatory gene activity in segment 
precursor cells’*’*, termed presomitic mesoderm (PSM) in vertebrates. 
In this context, several signalling pathways, such as Notch’, Wnt’* and 
Fef"®, have been shown to exhibit oscillatory activity, with periods ran- 
ging from ~0.5-2.5h (ref. 12). Oscillation dynamics can be visualized 
using in vivo real-time imaging methods (Fig. la-~c and Supplementary 
Video 1), revealing highly coordinated activity patterns within the PSM. 
These are kinematic or phase waves, as they result from cell-autonomous 
oscillatory activity'’ that is phase-shifted between neighbouring cells. 
Accordingly, physical boundaries do not block their progression 
(Supplementary Fig. 1 and ref. 17). Although oscillations have been 
implicated to function within a clock mechanism to control segmenta- 
tion temporally, the function of the phase shift per se in segment defi- 
nition has not yet been addressed, in part, owing to the challenge to 
perturb and analyse phase shifts experimentally at a quantitative level. 

We therefore developed an ex vivo mouse primary cell culture assay 
for mesoderm patterning and segment formation, allowing precise 
quantifications of oscillatory activities in a simplified, two-dimensional 
system that, in addition, enables us to study segmentation at differing 
spatial scales. 

Using real-time imaging of a dynamic Notch-signalling reporter, 
LuVeLu’® (Fig. 1d-g and Supplementary Videos 2 and 3), we found 


that in this assay, cultured mesoderm cells exhibit robust and long- 
lasting gene activity oscillations (n ~ 12-15 oscillations, Fig. 1d-—f). 
These are equivalent in periodicity to oscillations detected in intact 
PSM (Fig. 1g). Moreover, we identified highly coordinated, concen- 
tric waves of gene activity that sweep across the mPSM in a central- 
peripheral direction (Fig. 1d). Like their in vivo counterparts, these 
waves are kinematic in nature (Supplementary Fig. 2). 

We confirmed that key mesoderm patterning and differentiation 
events are recapitulated in the ex vivo cell culture assay at the molecular 
level (Fig. 2 and Supplementary Fig. 3). The highest activities of the 
Wnt- and Fgf-signalling pathways were detected centrally, on the basis 
of the expression of several direct transcriptional targets, such as T, 
Msgn1 and Dusp4. At the periphery, markers crucial for somite forma- 
tion and differentiation, such as Mesp2 and Uncx4.1 (also known as 
Uncx), become activated (Fig. 2c and Supplementary Fig. 3). Remark- 
ably, we found that at this location in the mPSM, morphological 
segment boundaries are formed (Fig. 2d, e). Similar to in vivo, these 
boundaries form where gene activity waves halt, coinciding with the 
expression of Mesp2 (ref. 19; Supplementary Videos 4-7). 

Importantly, we identified that segment scaling occurs after changes 
in mPSM length. Once the primary cell culture expands to its max- 
imum dimensions after ~20h of incubation, a progressive decrease 
in mPSM length is observed. This is due to continuing segmentation 
which, in the absence of substantial further growth, leads to a constant 
reduction of undifferentiated mesoderm. Notably, we found that as 
mPSM length decreases, proportionally smaller segments formed 
(Fig. 3a). A linear correlation between segment size and mPSM length 
was maintained (P<0.0001) over the full range of PSM lengths 
studied (~4-fold changes). In agreement with previous in vivo findings 
in vertebrate embryos’, segment scaling in our ex vivo assay also 
occurred along the anterior—posterior axis, effectively reducing the 
width of segments. This demonstrates that segment scaling is recapi- 
tulated in our two-dimensional, ex vivo cell culture assay, establishing a 
new experimental model to study the underlying mechanism. To this 
end, we performed a detailed quantification of oscillations and phase- 
wave dynamics during the scaling process. 

First, we found that in the central mPSM the oscillation period 
remained constant, irrespective of PSM length (Supplementary Fig. 
4). We next analysed the velocity of kinematic waves, a read-out of the 
phase distribution and therefore of oscillation dynamics in the mPSM 
(see definition in Supplementary Fig. 5 and below). Surprisingly, we 
found that the velocities of kinematic waves change linearly with over- 
all mPSM length (P < 0.0001), and thus larger samples show propor- 
tionally faster kinematic waves than smaller samples (Fig. 3b). This 
indicated that oscillatory activity is modified to match precisely the 
spatial context in which it occurs. 

To extend this finding, we directly analysed oscillation phases 
within the mPSM. We used real-time measurements to calculate the 
phase (g) independently for every position within the mPSM (Sup- 
plementary Fig. 5). In a second step, we determined the spatial phase 
differences, which allowed us to determine experimentally the phase 
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Figure 1 | An ex vivo cell culture model for gene 
activity oscillations. a—c, Schematic 
representation of a mouse embryo at embryonic 
day (E)10.5 (a) illustrating the experimental set-up 
for in vivo fluorescence imaging (b, c) of LuVeLu 
activity'®. A posterior embryo fragment including 
the PSM was used. Dashed red line in a shows the 
level of the cut. d-f, Ex vivo cell culture assay, in 
which the tail bud mesoderm was isolated (solid 
blue line in a). For the time series of real-time 
imaging experiments (b, d), a subregion is 
magnified (shown at the top right) to exemplify the 
changes in intensity. The expression patterns at 
successive time points (times indicated in minutes) 
are projected into the last thumbnail. b, Note 
periodic activity waves, sweeping from posterior to 
anterior PSM. c, Kymograph along the PSM (arrow 
in first frame of b) showing spatial (y axis) and 
temporal (x axis) quantifications (intensity colour- 
coded), depicting regular oscillations and activity 
waves from posterior to anterior. d, For the ex vivo 
assay, real-time imaging reveals LuVeLu reporter 
activity waves that progress in a central-peripheral 
direction. e, The intensity within the magnified 
region in d is plotted over time. f, Kymograph 
based on quantification along the arrow depicted in 
d; regular pulses and waves in intensity are seen 
over the entire course of the ex vivo culture. 

g, Quantification of oscillation period in in vivo 
experiments and ex vivo cell culture assays shows 
that there is no significant difference (P = 0.64, 

n = 12 for in vivo PSM, n = 16 for ex vivo cultures). 
Errors bars denote s.d.; NS, not significant. All scale 
bars, 100 Lm. 


ae ei 


Figure 2 | Molecular and morphological analysis of the ex vivo cell culture sharp boundaries between segments. $1-S3 denote segments in the order of 
model using in situ hybridization after 18-24h of culture. a, b, The Wnt- formation. f, Scheme illustrating overall reorganization of the embryonic 
target gene T (a) and the Fgf-target gene Dusp4 (b) are expressed centrally. anterior—posterior axis in a central-peripheral direction. Wnt- and Fef-target 
c, Mesp2, indicative for the onset of mesoderm differentiation, is activatedinthe genes are upregulated centrally and downregulated peripherally, the inverse is 
periphery of the cell culture assay. d, Bright-field image indicating segment true for the retinoic acid (RA) target gene Aldh1a2 (Supplementary Fig. 6). Lpsm 
formation. e, Magnification of region indicated by the red box in d, showing _ denotes length of PSM; L.<g denotes width of one segment. All scale bars, 100 jum. 
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Figure 3 | Quantification of segment sizes and oscillation dynamics reveals 
scaling behaviour due to a fixed phase gradient amplitude. 

a-c, e-g, Experiments were performed at either 37 °C (a-c) or 33 °C 

(e-g). a, Scatter plot of segment widths and length of mPSM from which they 
form indicates a linear correlation (P < 0.0001, n = 26 segments; n = 5 
samples). b, The velocity of kinematic waves shows a linear correlation to the 
length of mPSM in which they occur (P < 0.0001, n = 42 waves; n = 7 
samples). As a result, the total time required for a kinematic wave to travel 
entirely through mPSM of different lengths (time of flight, tpop) remains 
constant (trop = 128 min). c, Quantification of the phase-gradient amplitude 
within mPSM of different lengths: the total amplitude (y axis) is constant 
(2.04n + 0.21m rad, mean + s.d., n = 22), irrespective of variations in mPSM 
length (x axis). d, Scheme illustrating effect of fixed phase-gradient amplitude 
in samples of differing size: the total phase span remains constant despite 
changes in total length, and thus the phase distribution is proportionally 
adjusted. As a consequence, proportional segments contain the same phase 
span. e, At 33 °C, segment widths show a linear correlation with mPSM length 
and the regression coefficient is indistinguishable from the correlation found at 
37 °C (dashed line; P = 0.509, n = 13 segments; n = 4 samples). f, The 
regression coefficient between kinematic wave velocities and mPSM length is 
significantly altered after a temperature shift from 37 °C to 33 °C (P< 0.001, 
n= 16 waves; n = 3 samples). g, Even at 33 °C, the phase-gradient amplitude 
is constant (2.082 + 0.17 rad, n = 10), and indistinguishable from that at 

37 °C (P= 0.45). 


gradient in the mPSM (Supplementary Fig. 5). Measuring the phase 
difference between the most posterior and most anterior oscillating 
mPSM positions yields the total phase-span present in the mPSM, 
equivalent to the phase-gradient amplitude. Importantly, we found 
this amplitude to be constant, close to 27 rad, irrespective of mPSM 
lengths (Fig. 3c). Having a constant gradient amplitude means that the 
slope of the phase-gradient (0g/0x, in which x denotes length), the 
phase difference between mPSM cells, is inversely proportional to 
mPSM length and segment size. In other words, larger samples show 
a shallower phase gradient and thus less phase differences between 
mPSM cells than small samples (Fig. 3d). Such a gradient behaviour 
can fully account for scaling, as suggested previously”. Accordingly, 


LETTER 


we find that independent of the mPSM length, a newly formed seg- 
ment spans ~21% of the mPSM phase gradient (Fig. 3a, d). 

Our results identify two predictive parameters, the phase-wave 
velocity (v) and the slope of the phase-gradient (0y/0x), that scale 
to mPSM length and segment size (Fig. 3b, c). These parameters are 
interconnected by the following relationship: y= Op/ ot/ dp/ ax in 
which 6g/0t is the oscillation frequency. To dissect the part that each 
parameter plays during the scaling process, we performed temperature- 
shift assays. In zebrafish it has been previously shown that a change in 
temperature influences the rate of segmentation without, however, alter- 
ing segment sizes”. We found a similar result in the mouse mPSM ex 
vivo cell culture model; segmentation proceeded more slowly at 33 °C, 
yet segment sizes remained unchanged and followed the iden- 
tical correlation to mPSM length as at 37°C (Fig. 3e). To test which 
of the two predictive parameters we identified, v and 0y/0x, can 
account for temperature-invariant segment sizes, we quantified oscil- 
lation dynamics in the temperature-shift assay. First, we found 
that the overall oscillation period (T) was altered by temperature 
(T37°¢ = 1374 min + 7.35.4. n=16; T33-c=193.4min+ 14.4s.d., 
n= 3). In addition, we found that although the phase-wave velocity 
(v) still correlated with mPSM size (Fig. 3f), overall, v was significantly 
decreased at lower temperature (Fig. 3f). Therefore, phase-wave velocity 
(v) cannot individually account for temperature-invariant segment size 
definition. 

By contrast, we found that the phase-gradient amplitude remains 
constant at 2m rad in samples grown at 33 °C (Fig. 3g). This indicates 
that the phase-gradient slope (0 /0x) is predictive for segment sizes in 
a temperature-independent manner. 

Our findings thus provide strong evidence that a crucial parameter 
for segment size definition is encoded at the level of phase differences 
between PSM cells, supporting the general concept that temporal 
order, that is, a phase gradient, can provide spatial information for 
embryonic patterning”. 

In principle, the mechanism underlying phase-gradient scaling 
could rely on sensing global mPSM length, for instance, by integrating 
signals from opposing signalling gradients**. Indeed, such opposing 
signalling gradients have been previously identified within the PSM”. 
To test this possibility, we analysed our ex vivo cell culture assay during 
the initial culture period (<20h of culture). Using both morphological 
and molecular data, we found that at these early time points, only 
an incomplete mPSM showing uniquely a posterior, undifferentiated 
mPSM identity, was present (Supplementary Fig. 6). Accordingly, 
anterior mPSM molecular markers, such as Mesp2 and Aldh1a2, are 
not yet expressed. Moreover, all cells retain oscillatory activity and 
segments do not form during the first 20 h of culture. Analysis of 
the phase-gradient amplitude within the incomplete mPSM showed 
that it measured clearly less than 2m rad (Supplementary Fig. 7). 
Importantly, even in the incomplete mPSM and thus in absence of 
an anterior, opposing gradient, we identified clear evidence for phase- 
gradient scaling (Fig. 4). Thus, we found that the phase-gradient slope 
changed exponentially throughout the culture, irrespective of whether 
a complete mPSM was present or not (Fig. 4b). This suggests the 
presence of a single underlying mechanism that functions indepen- 
dently of opposing mPSM-signalling gradients. 

We then analysed phase-wave velocities within the incomplete 
mPSM. To this end, we determined the virtual mPSM lengths at which 
a phase-gradient amplitude of 27 rad, the value that we found to be 
characteristic for a complete mPSM, is reached (Supplementary Fig. 7 
and Fig. 4a). We found that phase-wave velocities are linearly propor- 
tional to these projected mPSM lengths (Fig. 4a, inset). Crucially, the 
correlation is nearly identical to the one found at later time points, 
when velocities can be compared to de facto, complete mPSM lengths 
(Fig. 4a, inset). 

Such a scaling behaviour is markedly different from other examples 
of scaling”®, as it does not rely on measuring global mPSM size and 
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Figure 4 | Phase-gradient scaling does not rely on sensing global size cues. 
a, b, Experimental measurements of phase gradients. c, d, Numerical 
simulations of the integrative kinematic scaling model (Supplementary 
Information). a, Truncated phase-gradients (waves —5 to —1; see 
Supplementary Fig. 7) are spatially projected to the point of their completion 
(red squares); this defines the (projected) mPSM length at which the phase- 
gradient amplitude measures 27. Inset shows a scatter plot of early, truncated 
phase-wave velocities, and projected mPSM lengths indicate a linear 
correlation (P < 0.0001). The regression coefficient is not significantly different 
(P = 0.867, n = 27 waves; n = 6 samples) from the one calculated at later time 
points, when actual mPSM lengths are measured (dashed line represents 

the regression line from Fig. 3b). b, Plot of the (inverse) phase-gradient slope (y 
axis) as a function of oscillations cycles (x axis) shows that the change in slope 
over time is a uniform function throughout the entire measurements, 
irrespective of whether a truncated mPSM (waves —6 to —1) or a complete 


operates even when the total field to be scaled is not yet completely 
formed. 

Furthermore, these findings reflect the ability of mPSM to retain a 
developmental memory after isolation, as has been previously found in 
classical experimental embryology work (reviewed in ref. 25), and as 
has been suggested in theoretical models”. Accordingly, segments 
form after an initial delay of ~20h (Fig. 4) as the posterior mPSM 
fragments retain an undifferentiated identity. At the same time, how- 
ever, our approach reveals a marked plasticity within the posterior 
mPSM that enables considerable changes in oscillatory dynamics 
and ultimately, segment scaling to occur. 

The precise molecular circuits leading to segment scaling are as yet 
unknown. Previously, signalling gradients were functionally impli- 
cated in controlling PSM differentiation’*, however, experimental 
evidence for a role in directly controlling the oscillation dynamics is 
still lacking. We confirmed that Wnt and Fef gradients are present 
throughout the ex vivo culture (Fig. 2 and Supplementary Fig. 8). 
However, in the absence of quantitative and, importantly, dynamic 
data, their precise function in this scaling process and the interaction 
with complex phase-gradient dynamics awaits further investigation. 

Here we used modelling to address the principles of segment scaling. 
We mathematically formalized our results and asked whether these are 
sufficient to recover scaling behaviour in silico. This model (see full 
derivation in Supplementary Information) is based on the following 
key findings. First, we identified that oscillation dynamics pivot around 
a fixed phase-gradient amplitude of ~27 rad. The observed mainten- 
ance of a fixed phase shift Ag, between the posterior PSM and the 
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mPSM (waves +1 to +5) is present. An exponential trendline is overlaid in 
black (R* = 0.99). Error bars indicate s.d. c, Kymograph corresponding to 
simulation of equation (8) in the Supplementary Information, for two 
successive regimes (growth and no growth). Levels of grey indicate cosine of 
phase. The arrested phase serves as a proxy for pattern formation. Note that in 
this model, the size of the pattern is a fixed fraction of oscillating field size 
(~PSM). Thus, during PSM growth, patterns are stable (first five waves), once 
growth ceases and PSM length is reduced, pattern size (~segment size) scales 
proportionally (last five waves). The growth regime is experimentally observed 
in vivo (Fig. 1b, c), whereas the no-growth regime is reflected experimentally in 
the ex vivo cell culture assay (a). A.U., arbitrary units. d, On the basis of 
numerical simulations, the slope of the phase gradient (plotted as the inverse, 
blue diamonds) was calculated for each consecutive oscillation cycle (x axis). 
Note the (inverse) correlation of the phase-gradient slope to corresponding 
segment size (red squares). 


anterior differentiation front, irrespective of PSM length, strongly 
argues for a tight interdependence between segmentation oscillator 
and differentiation front definition. This is in contrast to common 
models for segmentation, such as the clock and wave-front model!?, 
in which oscillations and differentiation front are two interacting, but 
essentially independent entities. Second, our quantifications indicate 
that the phase gradient evolves nonlinearly over time, with an expo- 
nential function describing the system accurately (Fig. 4b). This means 
that oscillations are slowing down exponentially throughout the entire 
culture period (with the exception of the most posterior mPSM loca- 
tion; Supplementary Fig. 4). These findings were formalized using a 
kinematic phase equation”’. Taking further into account posterior 
growth and coupling between cells in the PSM (Supplementary 
Information), the complete numerical simulation closely recapitulates 
the observed experimental data (Fig. 4c, d). This shows that this mini- 
mal set of principles can fully account for segment scaling. Thus, 
we propose that scaling is a direct consequence of nonlinear phase- 
gradient dynamics combined with a fixed gradient amplitude. 
Finally, our identification that the phase gradient is the single- 
known predictive parameter for both segment scaling and temperature 
compensation has important conceptual implications. Phase differ- 
ences within the PSM are a defining feature of oscillations during 
the segmentation process'*"*, and the mechanism of how these differ- 
ences are set up is being investigated*””. However, current models of 
segmentation’®”° generally do not assign any clear functional relevance 
to these phase differences per se. This work prompts a change in 
perspective as our experimental findings support the concept that 
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spatial patterning is based on information encoded at the level of 
oscillation phase differences between mPSM cells. 

In conclusion, we present a powerful ex vivo culture assay for meso- 
derm segmentation that enabled the identification of a conceptually 
distinct segment-scaling mechanism based on phase-gradient encoding. 


METHODS SUMMARY 


All parameter measurements were done on data from real-time imaging of the 
LuVeLu transgenic reporter line’, which expresses a destabilized version of Venus 
(yellow fluorescent protein) under the lunatic fringe reporter. For the ex vivo cell 
culture assay, tail bud mesoderm was removed from mouse embryos at embryonic 
day 10.5 and cultured in fibronectin-coated chamber slides under controlled 
environmental conditions on an inverted microscope (Zeiss LSM 780 NLO). In 
situ messenger RNA hybridizations and immunohistochemistry were performed 
directly in chamber slides. Image processing, analysis and parameter measure- 
ments were done using software packages ZEN (Zeiss), Fiji and MATLAB. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 
Ex vivo cell culture assay for mesoderm patterning. Chamber slides (Lab-Tek) 
were coated with 50 1g ml’ fibronectin (Sigma) in 100 mM NaCl for 4h at room 
temperature or overnight at 4 °C. After coating, the dishes were air-dried for 5 min 
and washed with embryo culture medium (DMEM-F12, Cell Culture Techno- 
logies) plus 0.5 mM glucose, 2mM glutamine and 1% BSA) for 20 min with slight 
shaking. E10.5 embryos of LuVeLu"' X Cd1 crosses were dissected in HEPES- 
buffered embryo culture medium (culture medium plus 10 mM HEPES). To iso- 
late tailbud mesoderm, a transversal cut at or behind the posterior neuropore was 
done. Tailbuds were placed with this transverse cut facing down on the fibronectin- 
coated dishes. Routine culture conditions were 37 °C, 5% CO, and ambient O, 
levels, or as previously described". 
Two-photon microscopy. Imaging was performed with a Zeiss LSM780 
laser-scanning microscope. Samples were excited using a Ti:Sapphire Laser 
(Chameleon-Ultra, Coherent) at a wavelength of 960nm through a X20 plan 
apo objective (numerical aperture 0.8). A Z-stack of either 6-8 planes (for in vivo 
imaging) or 3-4 planes (for the ex vivo assay) at a 6-8 jm distance was scanned 
every 10min; up to 16 samples were recorded during an experiment using a 
motorized stage. 
Further mouse strains and animal work. The Mesp2-GFP line’! was obtained by 
the RIKEN BRC through the National Bio-Resource Project of the MEXT, Japan. 
Both the LuVeLu and the Mesp2-GFP line were maintained on a Cd1 background. 
All animal experiments were conducted under veterinarian supervision and rules 
of the European Molecular Biology Laboratory, following the guidelines of the 
European Commission, Directive 2010/63/EU and AVMA Guidelines 2007. 
In situ hybridization on primary culture PSM tissue. In situ hybridization and 
probe generation was performed as described previously’*, with minor modifica- 
tions when applied to the primary ex vivo culture samples (that is, reduced incuba- 
tion and washing times). The entire hybridization and colour reactions were 
performed in chamber slides. Probes were used as described previously’* or amp- 
lified using the following primers: Meox1 forward 5'-TGAGATTGCAGTCA 
ACCTGG-3’, reverse 5'-TTGGAGAACACAAGACGCTG-3’; Pitx2 forward 5'- 
TCAGAGTATGTTTTCCCCGC-3’, reverse 5’'-CGCAAGCGAAAAATCCTAAC- 
3'; Cldné forward 5'-CAGAGCCCTCTGTGTTGTCA-3’, reverse 5’-AGAGGTGG 
AGCTTGGACTCA-3'; Sox2 forward 5'-ACCAGCTCGCAGACCTACAT-3’, 
reverse 5'-ACGAAAACGGTCTTGCCAGT-3’. 
Immunofluorescence on primary culture PSM tissue. Ex vivo cell cultures were 
washed with PBS and fixed on ice for 1 min in 50% methanol, 50% dimethylsulph- 
oxide (DMSO). Afterwards, cells were treated with 15% H,O2, 50 mM NH,Cl in 
PBS for 15 min on ice. Cells were washed three times with 1% Triton X-100 in PBS 
and blocked for 10 min with PBS plus 1% Triton X-100 and 10% FCS. Primary 
antibody incubations were carried out at 4°C overnight. Phosphorylated ERK1/2 
was detected with a 1:150 dilution of rabbit-anti-phospho-p42/44 MAPK (ERK1/2) 
(Cell Signaling, D13.14.4), B-catenin was detected using a 1:500 dilution of mouse- 
anti-f-catenin (BD Transduction, 610153). After incubation, the cells were washed 
three times with PBS plus 1% Triton X-100 and 10% FCS, and subsequently incu- 
bated with secondary antibody (1:250 of Fab -goat-anti-rabbit-Alexa488; Cell 
Signaling, 4412S; or 1:500 of Fab2-goat-anti-mouse-Alexa555; Cell Signaling, 
4409S) at 4°C overnight. After secondary antibody incubation, cells were washed 
twice with PBS and 1% Triton X-100 plus 10% FCS, and incubated with 1:1,000 
4',6-diamidino-2-phenylindole (DAPI) in PBS plus 1% Triton X-100 for 1h at 
room temperature. Once nuclear staining was judged to be sufficient, cells were 
imaged on an LSM780 microscope (Zeiss). 

For temperature-shift experiments, samples were grown at 33 °C; only samples 
in which clear segment formation occurred were used for subsequent analysis. 


Image processing. All data were recorded in 12bit, 512 X 512 pixels, 1.38 ym 
pixel ', all z-planes were considered for maximum intensity projections done 
in the ZEN-software. Fiji**was used for further image processing and the genera- 
tion of kymographs. To this end, maximum projection time series were blurred 
using a Gaussian filter (7 pixel sigma radius). Kymographs were generated along a 
line from the oscillation centre to the periphery on the blurred videos, perpendicu- 
lar to the wavefront. For further analysis with MATLAB, the kymographs were 
blurred again (Gaussian, 7 pixel sigma radius) and saved as text image. 

The trend of the kymograph intensity was removed and the intensity was 
smoothened using a moving average function. The Hilbert transform™ was calcu- 
lated along the time coordinate for every space point. From the Hilbert transform 
the instantaneous phase was extracted and plotted for every location and for all 
time points (Supplementary Fig. 5). 

Images of in situ mRNA hybridizations were taken using a Leica MZ16F stereo- 

miscroscope and a Leica DFC420C digital camera. Brightness and contrast were 
nonlinearly adjusted uniformly to the entire image. 
Parameter measurements. Velocities of kinematic waves were measured on the 
basis of phase kymographs. For each wave, a line was manually placed over the 
calculated phase with a value of 2. The resulting speed was given by tan(9) X time 
unit per pixel/space unit per pixel. 

Segment widths were measured at defined time points using ZEN software 
based on identification of segment boundaries in the transmitted light image; three 
independent measurements were taken and averaged. 

mPSM length was measured as the distance between the point of origin of 
oscillations (or, in LuVeLu-negative explants, corresponding morphological land- 
marks), and the positions at which these oscillations halt. 

To calculate the phase gradient and the total phase shift (or amplitude) in the 
mPSM (A@,), phase kymographs were used to unwrap phases along the space 
dimension for every time value. Then, AQ, = @max — Pmin for time points at which 
wave troughs reach the end of the mPSM. The average Ag, for 37 °C was calcu- 
lated from 22 waves from seven samples, and for 33 °C from 10 waves from three 
samples. 

From phase kymographs the slopes Ag/Ax (which in approximation corre- 
sponds to the partial derivative 0¢ /6x) were calculated. Phase values along space 
were plotted for time points at which each wave’s trough reaches the anterior 
mPSM end, as defined in phase kymographs. Correlation of phase (¢) as a function 
of length (x) was close to linear for the samples grown at 37 °C, and the slope (Ag/ 
Ax) was calculated by linear fitting. 

The time of flight (tpoz) was determined using the regression line in Fig. 3b, tror 
being the inverse of the slope. Periods for selected regions of interest were calculated 
using Lomb-Scargle algorithm™ applied on the changes of intensity over time. 
Statistics. Regression lines were fitted in Excel (Microsoft). For graphical repres- 
entation only, lines were forced through zero. Goodness of fit (R*), probability of 
linear correlation (F test) and similarity of regression coefficients (analysis of 
covariance; ANCOVA) were calculated with Prism 6 (GraphPad software). 
Comparison of means of two samples (unpaired Student’s t-test) was performed 
with Excel (Microsoft). 
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Restriction of intestinal stem cell expansion and 
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A remarkable feature of regenerative processes is their ability to 
halt proliferation once an organ’s structure has been restored. The 
Wnt signalling pathway is the major driving force for homeostatic 
self-renewal and regeneration in the mammalian intestine. However, 
the mechanisms that counterbalance Wnt-driven proliferation are 
poorly understood. Here we demonstrate in mice and humans 
that yes-associated protein 1 (YAP; also known as YAP1)—a 
protein known for its powerful growth-inducing and oncogenic 
properties'*—has an unexpected growth-suppressive function, 
restricting Wnt signals during intestinal regeneration. Transgenic 
expression of YAP reduces Wnt target gene expression and results 
in the rapid loss of intestinal crypts. In addition, loss of YAP results 
in Wnt hypersensitivity during regeneration, leading to hyperpla- 
sia, expansion of intestinal stem cells and niche cells, and formation 
of ectopic crypts and microadenomas. We find that cytoplasmic 
YAP restricts elevated Wnt signalling independently of the AXIN- 
APC-GSK-3f complex partly by limiting the activity of dishevelled 
(DVL). DVL signals in the nucleus of intestinal stem cells, and its 
forced expression leads to enhanced Wnt signalling in crypts. YAP 
dampens Wnt signals by restricting DVL nuclear translocation 
during regenerative growth. Finally, we provide evidence that 
YAP is silenced in a subset of highly aggressive and undifferentiated 
human colorectal carcinomas, and that its expression can restrict 
the growth of colorectal carcinoma xenografts. Collectively, our 
work describes a novel mechanistic paradigm for how proliferative 
signals are counterbalanced in regenerating tissues. Additionally, 
our findings have important implications for the targeting of YAP 
in human malignancies. 

YAP is a critical component of the Hippo signalling pathway, which 
controls organ size in mammals’*. Through a kinase cascade, the 
pathway targets YAP for phosphorylation, preventing its translocation 
to the nucleus, where it functions as a transcriptional co-activator. 
Current dogma suggests that restriction of the transcriptional activity 
of YAP is the principal mechanism of growth and tumour suppression 
by the Hippo pathway’. Indeed, nuclear YAP is a powerful driver of 
organ growth, progenitor proliferation and tumour growth'*. We 
previously assessed YAP function in the mammalian intestine by using 
a mouse model that resulted in ubiquitous postnatal expression of an 
inducible YAP(S127A) mutant’. This mutant protein is thought to 
have enhanced nuclear localization given that it escapes inactivation 
by the Hippo kinases LATS1 and LATS2 (ref. 3). As YAP might acti- 
vate paracrine signals’, we sought to bypass non-cell-autonomous 
effects by specifically expressing YAP in the intestinal epithelium using 
the Villin-rtTA driver®. YAP protein in the intestine of transgenic mice 
was not restricted to the nucleus, suggesting that $127 is not the major 
determinant of YAP subcellular localization in this tissue (Supplemen- 
tary Fig. 1a). Five-to-seven days after doxycycline (Dox) administration, 


transgenic mice became moribund and were killed. Surprisingly, histo- 
logical evaluation of the small intestine and colon of transgenic mice 
revealed a progressive degenerative phenotype associated with the rapid 
loss of proliferating crypts (Fig. 1a and Supplementary Fig. 1b, c). 
Crypt-loss phenotypes are typically associated with reduced Wnt 
signalling’. Indeed, degeneration was accompanied by repression of 
the Wnt target gene Cd44 and loss of cells with nuclear B-catenin 
(Fig. 1b and Supplementary Fig. le-g). Paneth cells are a mature 
intestinal lineage that require high levels of Wnt signalling for their 
proper differentiation and localization, and function as a critical com- 
ponent of the intestinal stem cell (ISC) niche*’. In YAP transgenic 


Control 


ControlA @& 
Control B 
Control C 
Day 2 Dox A 
Day 2 Dox B 
Day 2 Dox C 


Sic14a1 


Normalized 
expression 


Figure 1 | YAP overabundance inhibits Wnt-mediated intestinal 
regeneration. a, Haematoxylin and eosin (H&E) staining of Dox-induced 
YAP(S127A) small intestine at 2, 4 and 7 days. Inset shows Ki67 stain 
representative of crypt proliferation. b, c, Wnt pathway activity and ISC 
presence at 2, 4 and 7 days after Dox induction represented by CD44 (b) and 
Olfm4 (in situ) (c). d, Heatmap of crypts isolated from control (n = 3) and day 2 
Dox-treated (n = 3) mice showing 540 downregulated genes in rank order. 
Labelled genes are examples known to be involved in Wnt signalling and the 
ISC niche. Original magnifications in each panel are X20 (a, b), X10 (c). 
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mice, Paneth cells become mislocalized and eventually disappear 
(Supplementary Fig. 1d). To determine if YAP expression was redu- 
cing ISC numbers, we performed in situ hybridization (ISH) for Olfm4, 
which marks crypt base columnar (CBCs) stem cells’®. Olfm4* CBCs 
were markedly reduced 2 days after induction and were essentially 
absent by day 4 (Fig. 1c). Consistent with these data, microarray ana- 
lysis on isolated crypts showed that many robustly downregulated 
transcripts were known ISC signature genes (that is, Olfm4, Ascl2, 
Smoc2 and Lgr5) (Fig. 1d and Supplementary Fig. 2a, b). Gene set 
enrichment analysis (GSEA) demonstrated significant downregula- 
tion of both intestinal B-catenin targets and a recently described ISC 
gene signature’’’* (Supplementary Fig. 1c and Supplementary Table 1). 
Inhibition of growth, loss of Paneth cells and suppression of Wnt/ISC 
signature genes were confirmed in organoid cultures derived from 
transgenic mice (Supplementary Fig. 2d, e)’. Together, these results 
demonstrate that, in contrast to other tissues where it promotes growth, 
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Figure 2 | Loss of YAP leads to hyperactive Wnt signalling and expansion of 
the stem cell niche after injury or stimulation with Rspol. a, H&E, CD44, 
SOX9 and lysozyme (Lys) staining of small intestinal crypts in control and cKO 
mice 1 week after irradiation. b, H&E staining of inert-virus-treated control and 
cKO small intestine. c, d, H&E staining (c) and CD44 immunohistochemistry 
(d) in control and cKO mice 1 week after administration of adenovirus 
expressing Rspol. e, GSEA of the ISC gene signature and intestine-specific 
B-catenin target gene sets. Black bars represent individual genes in rank order. 
ES, enrichment score, NES, normalized enrichment score. f, Quantitative 
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epithelial-specific expression of YAP suppresses intestinal renewal, 
which occurs through inhibition of the Wnt signalling pathway. 
These observations prompted us to evaluate more carefully the 
phenotype of gut-specific YAP mutant mice. Consistent with a previous 
report’’, developmental or acute YAP loss produced no major abnor- 
malities during normal intestinal homeostasis (Fig. 2b and Supplemen- 
tary Fig. 3a). However, after injury by whole-body irradiation (9 and 
11 Gy), Villin-Cre Yap™" (conditional knockout (cKO)) mice showed a 
phenotype of crypt hyperplasia and overgrowth throughout the small 
intestine and colon (Fig. 2a and Supplementary Fig. 3c, e). This obser- 
vation contrasts to that of the impaired repair observed in a dextran 
sulphate sodium salt (DSS)-mediated colitis model'’ (Supplementary 
Fig. 3b). cKO crypts were hyperproliferative and showed upregulation 
of the Wnt target genes Cd44 and Sox9 as well as mislocalized and 
increased numbers of Paneth cells (Fig. 2a and Supplementary Fig. 3f). 
Apoptosis was not altered in cKO mice (Supplementary Fig. 3d). 
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polymerase chain reaction (qPCR) validating several upregulated Wnt/ISC 
markers. g, h, Immunohistochemistry on control and cKO small intestine for 
EGFP (expressed from the Lgr5 locus, inset is an untreated mouse) (g) and 
lysozyme marking Paneth cells (h). i, Ectopic crypt formation in cKO mice 
treated with Rspol stained for H&E or the Wnt target CD44. Original 
magnifications in each panel are X20 (a), X10 (b-d), X20 (g) and x10 

(h, i). Graphed data represent the mean and standard error of mean of three 
individual mice per genotype. 
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Considering that intestinal regeneration after irradiation is charac- 
terized by a state of Wnt hyperactivity'*"’, our data suggest a role for 
YAP in restricting elevated Wnt signalling in vivo. 

To test directly if loss of YAP results in Wnt hypersensitivity, we 
injected adenovirus expressing R-spondinl (Ad-Rspol; R-spondin1 
also known as Rspol) into cKO and control mice. Rspol is a potent 
secreted Wnt agonist that induces crypt-cell proliferation’®'’. Seven 
days after Ad-Rspol injection, 80% of cKO mice (n = 8) became mori- 
bund and were killed, whereas Ad-Fc-injected control mice (n = 8) 
appeared normal. The intestines and colon of cKO mice showed a 
massively hyperplastic phenotype, so that by day 7 most mature intest- 
inal lineages in cKO epithelia were replaced by highly proliferative 
crypt-like tissue (Fig. 2b, c and Supplementary Fig. 4a-i). Hyperplasia 
was accompanied by upregulation of Wnt targets CD44, SOX9 and 
EPHB3, in addition to global upregulation of the intestinal B-catenin 
target signature (Fig. 2d—-f and Supplementary Fig. 4j). To determine if 
YAP loss resulted in expansion of cells with the features of ISCs, we 
generated cKO:Lgr5-EGFP mice. LGR5 is normally expressed in the 
CBCs at the very bottom of the crypt (Fig. 2g, inset); however, after 
Rspol administration in control mice, the population of Lgr5* ISCs 
expands (Fig. 2g). This expansion is much more marked in the cKO 
intestine, where the Lgr5* domain is 3-4 times larger. ISC expansion 
was confirmed by ISH for Olfm4 (Supplementary Fig. 5a), and by 
GSEA (Fig. 2e and Supplementary Table 1). cKO mice also showed 
a prominent increase in Paneth cell numbers (Fig. 2h and 
Supplementary Fig. 5b, c,d). Paneth cells and LGR5* cells were evident 
high in the crypt/villus axis (Fig. 2g and Supplementary Fig. 5c, d). We 
also observed ectopic epithelial foci resembling crypts in cKO mice, 
which had formed within villi and stained positive for CD44, lysozyme 
and proliferative markers (Fig. 2i and Supplementary Fig. 5e-g). Some 
of these progressed into structures reminiscent of microadenomas, 
which are pre-cancerous lesions of outpocketing pouches of crypt-like 
tissue that grow into normal villi (Supplementary Fig. 5g). These were 
never observed in control Rspol-treated animals. Together, these 
results demonstrate that YAP restricts the expansion of ISCs as well 
as critical components of the stem cell niche. 

To gain insight into the mechanism behind our observations, we 
revisited the expression pattern of YAP in the intestine. It has prev- 
iously been shown that YAP is primarily localized to the crypt base, 
and is absent from villi*’*. Using a new polyclonal antibody, we find 
that YAP is enriched in the nucleus of CBCs and lower crypt cells 
(Supplementary Fig. 6a—c) but was also expressed in upper transit 
amplifying (TA) cells and throughout the villi, where it is predomi- 
nantly cytoplasmic (Supplementary Fig. 6a). This was confirmed by 
cellular fractionation experiments (Supplementary Fig. 6d). YAP 
localization correlated with the staining pattern of CD44, demons- 
trating that nuclear YAP is present in zones of active Wnt signalling, 
but is mostly cytoplasmic where Wnt is restricted (Supplementary 
Fig. 6e). These data indicate that cytoplasmic YAP might be function- 
ally responsible for terminating Wnt signalling and allowing pro- 
gress from a proliferative progenitor/stem cell compartment to a 
post-mitotic, differentiated fate. To rule out the possibility that a trans- 
criptional function of YAP was repressing Wnt, we generated 
Yap"S”*4 Villin-Cre mice. The YAP protein in these mutants cannot 
bind to TEAD transcription factors, the main transcriptional effectors 
of YAP"*””. After Rspol injection, we observed no enhanced Wnt res- 
ponse in YAP(S79A) mutant mice (Supplementary Fig. 7a). Supporting 
a role for cytoplasmic YAP in restricting Wnt signalling, expression of 
YAP wild-type and a YAP(S127D) phospho-mimic limited Wnt 
reporter responsiveness in 293T cells (Supplementary Fig. 7b, c). 

The phenotype observed in cKO mice treated with Rspol histo- 
logically resembled acute Apc deletion (Supplementary Fig. 8)”°. 
Surprisingly, we observed no obvious changes in B-catenin protein 
levels in hyperplastic cKO crypts (Supplementary Fig. 8a-c). As 
increases in B-catenin protein are the direct consequence of dis- 
rupting the AXIN-APC-GSK-3B complex, these data suggest that 
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YAP restriction of Wnt signalling is probably not mediated by modu- 
lating the activity of this complex. It has been recently suggested that 
phosphorylated cytoplasmic YAP sequesters f-catenin in the cytoplasm 
in cell lines*'. Although we observed a subtle expansion in the number 
of nuclear B-catenin-positive cells in cKO mucosa (Supplementary 
Fig. 8b), these probably represent the expansion of Paneth cells and 
we infer that this does not represent the major mechanism of YAP- 
mediated Wnt repression. In vitro, we observed increased Wnt activity 
after combined YAP and APC depletion versus knockdown of APC 
only, and a synergistic effect of YAP depletion together with GSK-3B 
small-molecule inhibition (Supplementary Fig. 8d, e). In mice, com- 
bined deletion of APC and YAP (double knockout) versus APC only in 
an acute or a long-term adenoma model” resulted in a robust upre- 
gulation of CD44 in addition to significant increases in the number of 
Paneth cells and proliferative cells (Supplementary Fig. 9a-g). Thus, 
our data suggest that YAP acts to restrict Wnt signalling independently 
to, and in some cases synergistically with, the activity of the destruction 
complex and control of subcellular localization of B-catenin. 
Recently, it was reported that TAZ, a protein that bears a 57% 
homology to YAP, interacts with DVL”, a positive Wnt signalling 
regulator. Work primarily done in Drosophila has shown that DVL 
acts in the cytoplasm upstream of the B-catenin destruction complex. 
However, emerging work in mammals suggests that DVL also exists in 
the nucleus, mediating the Wnt transcriptional response partly in 
conjunction with c-Jun and TCF4 (refs 24, 25). We determined that 
YAP also interacts with DVL2 (Supplementary Fig. 10a). The function 
of DVLs in intestinal regeneration has been poorly characterized, 
although the observation that loss of DVL2 causes a decrease in 
intestine length and crypt diameter suggests an important role in 
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Figure 3 | YAP restricts Wnt signalling by blocking DVL nuclear 
translocation. a, b, DVL2 immunohistochemistry in control and cKO small 
intestine treated with Ad-Fc (a) and Ad-Rspol-Fc (b). Insets are higher 
magnifications showing subcellular localization of DVL2. 

c, Immunofluorescent staining for DVL2 in confluent DLD1 cells after 
transfection with indicated short interfering RNA (siRNA). siSCR, scrambled 
siRNA. d, Expression analysis of Wnt target genes in DLD1 cells after 
transfection with siRNAs against YAP, DVL2 plus DVL3 (D2/3) or YAP plus 
DVL2/3. e, Expression analysis of organoids infected with lentivirus encoding 
Dox-inducible DVL-NLS alone, or on a Villin-rtTA TetO-YAP(S127A) 
background. Dox was given in culture medium for 4 days. All graphed data 
represent the mean and standard deviation of triplicate cultures. Original 
magnifications in each panel are X10 (a, b), X20 (c) and X4 (e). 
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intestinal biology”®. We find that DVL2 expression is restricted to the 
crypt compartment, where it is localized in the nucleus of CBCs and 
assumes a more diffuse pattern in the TA compartment (Fig. 3a). 
Furthermore, DVL2 nuclear localization is enhanced in crypts after 
Rspol administration and during regeneration following irradiation 
(Fig. 3b and Supplementary Fig. 10b). To evaluate the role of nuclear 
DVL in crypts, we infected intestinal organoids with lentiviruses 
expressing a Dox-inducible DVL carrying a nuclear localization signal 
(DVL-NLS)”. DVL-NLS increases Wnt target gene expression and 
leads to the transient formation of spheroid organoids resembling 
those with constitutive Wnt signalling (Fig. 3e)°. In addition, express- 
ion of DVL-NLS after APC knockdown results in a synergistic activa- 
tion of the TOPflash Wnt reporter (Supplementary Fig. 10c). Thus, our 
findings are consistent with an important nuclear function of DVL in 
intestinal progenitors and Wnt signalling in parallel to or downstream 
of the destruction complex. 

Given the observed physical interaction between YAP and DVL2 and 
their overlapping patterns of subcellular localization in crypts, we pro- 
posed that YAP might control DVL2 subcellular localization and/or 
activity. We observed a massive expansion of cells positive for nuclear 
DVL2 in cKO mice treated with Rspol and during radiation-induced 
regeneration (Fig. 3b and Supplementary Fig. 10b). Additionally, YAP 
knockdown led to DVL nuclear accumulation in DLD1 cells (Fig. 3c and 
Supplementary Fig. 10d, e). We found no increase in levels of nuclear 
B-catenin in this context (Supplementary Fig. 10e). Loss of DVL2 and 
DVL3 partially or completely rescued increased Wnt target gene 
expression in the absence of YAP (Fig. 3d). To determine if YAP could 
block DVL-induced Wnt transcriptional responses, we generated orga- 
noids expressing Dox-inducible DVL-NLS (Supplementary Fig. 10f) 
and YAP. Expression of YAP completely abrogated the DVL-mediated 
upregulation of the Wnt target genes LgrS5 and Axin2 (Fig. 3e). 
Therefore, we conclude that YAP is critical for the proper subcellular 
localization of DVLs and blocks their ability to induce Wnt signalling. 

Finally, we investigated the role of YAP in human colorectal car- 
cinoma. Using DLD1 cells infected with Dox-inducible YAP wild type 
or YAP(S127D), we performed xenograft assays to assess the effect of 
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Figure 4 | YAP function in human colorectal cancer. a, Xenograft tumour 
formation of DLD1 cells infected with lentivirus encoding Dox-inducible YAP 
wild type (WT) or YAP(S127D) mutant. Tumour photographs are 
representative images. Ruler is in centimetres. b, Heatmap representing the top 
328 genes downregulated by YAP(S127D) induction in xenograft assays (n = 3 
tumours per treatment group). Labelled genes are known f-catenin targets in 
human colorectal carcinoma. c, GSEA of TCF4-dependent colorectal 
carcinoma (CRC) target genes. d, Overall survival analysis of patients with 
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YAP on tumour growth. We found a marked decrease in tumour 
growth with the addition of Dox, particularly with YAP(S127D) 
(Fig. 4a and Supplementary Fig. 1la, b). This decrease in tumour 
size coincided with global suppression of the colorectal carcinoma 
TCF4/B-catenin’”” and ISC'' gene signatures (Fig. 4b, c, Supplemen- 
tary Fig. 1lc, d and Supplementary Table 1). We next examined if 
YAP loss was associated with human colorectal carcinoma by evalu- 
ating YAP expression in a cohort of 672 colorectal carcinoma samples 
using immunohistochemistry**. Complete loss of YAP staining occurred 
within a minor subset of human tumours (10.5%), but predicted 
worse patient survival and was associated with high-grade, stage 
IV disease, compared with YAP-positive groups (Supplementary 
Fig. 1le-h and Fig. 4d-f). These results are consistent with our animal 
data and demonstrate that YAP may act as a tumour suppressor in 
human colorectal carcinoma. 

Precise mechanisms must exist to dampen proliferative potential to 
maintain organ size. We have identified YAP and DVLs as key players 
in this process by restricting the expansion of ISCs and their niche 
during Wnt-driven regeneration. Although we have not ruled out a 
pro-proliferative role for nuclear YAP, our results suggest that the 
predominant function of YAP in the intestine is growth restrictive. 
During normal development and homeostasis, TAZ (whose expression 
is highly enriched in crypts; Supplementary Fig. 6d) might be com- 
pensating for the loss of YAP. Alternatively, YAP together with DVL 
might be part of a regeneration-exclusive molecular machinery that 
helps counteract hyperactive Wnt signalling, which occurs during emer- 
gency growth. Interestingly, YAP is a WNT target gene”, suggesting 
that it might participate in a negative feedback loop to limit WNT- 
initiated signals. At present, therapeutic efforts are aimed at dimi- 
nishing YAP protein levels in many malignancies, including colorectal 
carcinoma”’. Our results suggest that such manipulations could poten- 
tially result in increased tumour growth. YAP-targeting therapies 
should aim to disrupt the YAP-TEAD interaction or to induce the 
cytoplasmic translocation of YAP. Further study of the role of YAP asa 
growth suppressor is likely to provide important insights into the 
biology of regeneration, size control and tumour progression. 
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specific YAP staining patterns. YAP immunohistochemistry was placed into 
four groups of staining patterns correlating with subcellular localization from 
672 colorectal cancer patients: cytoplasmic and nuclear (C (+) N (+)); 
cytoplasmic (C (+) N (—)); nuclear positive (C (—) N (+)); and complete loss 
of staining (C (—) N (—)). e, YAP staining in low- and high-grade tumours. 
Original magnification, X20. f, YAP loss is significantly associated with high- 
grade tumours and stage IV disease (CI, confidence interval; OR, odds ratio). 
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METHODS SUMMARY 

Immunohistochemistry. Staining was performed with the following antibodies 
using the Vectastain ABC-Elite kit according to the manufacturer’s instructions, 
except for Cryptdin1, which was detected using an anti-goat Alexa-Fluor 488 
(Invitrogen). Rabbit anti-YAP (Cell Signaling 1:40, Avruch Lab 1:400), mouse 
anti-PCNA (Santa Cruz Biotechnology), rat anti-CD44 (BD Biosciences 1:100), 
rat anti-Ki67 (Dako 1:40), rabbit anti-phospho-histone H3 (Millipore 1:300), 
rabbit anti-SOX9 (Millipore 1:300), goat anti-EPHB3 (R&D 1:300), rabbit anti- 
GFP (Abcam 1:3,000), goat anti-Cryptdin1 (A. Ouellette), rabbit anti- DVL2 (Cell 
Signaling 1:100). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 
Mice. Yap conditional knockout and S79A point mutant mice were previously 
described’. Yap cKO mice were intercrossed with Villin-Cre°', Villin-CreER”, 
Lgr5-EGFP-IRES-CreER® and Apc" mice*. Mice were given a single dose of ta- 
moxifen (1 mg in corn oil injected intraperitoneally) in Yap/Apc conditional knock- 
out experiments with Villin-CreER. Villin-rtTA mice were described previously’. 
For in vivo administration of R-Spondin1, adenovirus expressing Fc-tagged 
R-Spondin1 of mouse origin was retro-orbitally injected at 3 X 10° plaque-forming 
units (p.f.u.). Virus was produced as previously described'*. Tissue was collected at 
3.5 or 7 days. For irradiation experiments, mice were exposed to 9 or 11 Gy ata dose 
of 0.664 Gy min’ * from a caesium 137 irradiator. Tissue was collected 5 days after 
irradiation. 
Immunohistochemistry of mouse tissue. Staining was performed with the fol- 
lowing antibodies using the Vectastain ABC-Elite kit according to the manufac- 
turer’s instructions, with the exception of Cryptdin1, which was detected using an 
anti-goat Alexa-Fluor 488 (Invitrogen). Rabbit anti-YAP (Cell Signaling 1:40, 
Avruch Lab 1:400), mouse anti-PCNA (Santa Cruz Biotechnology), rat anti-CD44 
(BD Biosciences 1:100), rat anti-Ki67 (Dako 1:40), rabbit anti-phospho-histone H3 
(Millipore 1:300), rabbit anti-SOX9 (Millipore 1:300), goat anti-EPHB3 (R&D 
1:300), rabbit anti-GFP (Abcam 1:3,000), goat anti-Cryptdin1 (Andre Ouellette), 
rabbit anti-DVL2 (Cell Signaling 1:100). Antigen retrieval was performed using low 
pH citrate buffer. Staining for b-catenin was performed as previously described”. 
Lysozyme (Dako rabbit anti-human lysozyme) staining was performed as follows. 
Antigen retrieval was performed by proteinase-K digestion at 37°C for exactly 
12min in tris-EDTA pH 9 followed by PBS wash for 5 min and quenching of 
endogenous peroxidases and Vectastain ABC-Elite kit processing. Next, tissue was 
incubated in primary antibody at 1:1,000 in PBS 0.1% Triton at 4 °C overnight. 
ISH. ISH for Olfm4 transcripts was performed on formalin-fixed intestine at the 
specialized histopathology service core at the Dana Farber Cancer Institute using 
RNAscope, according to the manufacturer’s instructions. 
Quantification of phosphorylated H3S10 and lysozyme foci. For Villin-CreER 
experiments, lysozyme quantification was performed on a cohort of five control 
(Ape) and five double-cKO (Yap"" Ape™") mice. The quantification was per- 
formed blind to specimen identity. Paneth cell numbers were quantified by the 
number of crypt cells staining positive for lysozyme on a per crypt basis. For each 
sample, per crypt numbers of lysozyme-positive cells were averaged over at least 
15 X10 fields of view per mouse. In Villin-CreSt experiments, phosphorylated 
H3S10 and lysozyme positivity was counted in a similar manner. Student’s t-test 
was used to determine statistical significance with a cut-off of P< 0.05. 
DSS treatment. Mice were given 2.5% DSS in drinking water for 7 days, followed 
by normal drinking water for 3 days. Colon tissue was snap frozen and cryosec- 
tions were stained with haematoxylin to visualize tissue architecture. 
siRNA-mediated knockdown. Cells were reverse transfected with a 7 nM final 
concentration of the indicated siRNA (Ambion Silencer Select) using Lipofec- 
tamine RNAiMax (Invitrogen), according to the manufacturer’s instructions. 
For expression analysis in DLD1 cells, RNA was extracted 4 days after transfection. 
Oligonucleotides against human transcripts used were (5'-3'): siAPC sense, GG 
AUCUGUAUCAAGCCGUUTT,; antisense, AACGGCUUGAUACAGAUCCTT 
(oligonucleotide no. s1433); siYAP sense, GGUGAUACUAUCAACCAAATT; 
antisense, UUUGGUUGAUGUAUCACCTG (oligonucleotide no. 20366); siDVL3 
sense, GAUAUGUUGUUACAGGUAATT; antisense, UUACCUGUAACAAC 
AUAUCTC (oligonucleotide no. s675); siDVL2 (pooled two siRNAs) sense, CA 
CCAUCCCUAAUGCCUUUTT; antisense, AAAGGCAUUAGGGAUGGUGAT 
(oligonucleotide no s4396); and sense, CAGUCACGCUAAACAUGGATT; anti- 
sense, UCCAUGUUUAGCGUGACUGTG (oligonucleotide no. s4398). 
RNA extraction, cDNA synthesis and qPCR. RNA was extracted from cells or 
tissue using an RNeasy mini kit according to the manufacturer’s instructions 
(Qiagen). cDNA synthesis was performed on 1 jig of total RNA using iScript 
cDNA synthesis kit (Biorad). qPCR was performed using Taqman probes available 
from Applied Biosystems and Taqman Fast-Advanced master mix. Probed used 
were as follows: mouse Cd44 Mm01277163_m1; Lgr5 Mm00438890_m1; Myc 
Mm004877804; Wwtrl (Taz) Mm00513560_m1; Nkd1 Mm00471902_m1; Yap 
Mm00494240_m1; Sox9 Mm00448840_m1; Cd133 (Prom1) Mm00477115_m1; 
Axin2 Mm00443610_m1; Ape Mm00545877_m1; and Fabp2 Mm00433188_ml. 
Human probes were: NKD1 Hs00263894_m1 and FOXN1 Hs00263894_m1. All 
reactions were run on Applied Biosystems Step-One-Plus real-time PCR instru- 
ment and data were calculated using the AAC, method. Significance values were 
calculated using Student’s t-test with a P-value cut-off of less than 0.05. 
Immunofluorescence. For immunocytochemistry of adherent DLD1 cells, cul- 
tures were grown on sterile coverslips and washed once with PBS followed by 
fixation in 4% paraformaldehyde/PBS for 15 min at room temperature (~20 °C). 
Cells were then washed twice with cold PBS and permeabilized with PBS/0.25% 
Triton X-100 for 10 min at room temperature. Cells were then washed three times 
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in PBS followed by blocking in 1% BSA/PBS for 30 min at room temperature. Next, 
cells were incubated in primary antibodies in 1% BSA/PBS overnight at 4°C 
followed by 3 washes in PBS. Cells were then incubated for 1 h in a dark humidi- 
fied chamber with Alexa-Fluor-conjugated secondary antibodies in 1% BSA/PBS 
followed by 3 washes in PBS. Coverslips were then stained with 4’ ,6-diamidino-2- 
phenylindole (DAPI) and mounted on glass slides. Staining was performed with 
the following antibodies: mouse anti-YAP (Santa Cruz 1:500) and rabbit anti- 
DVL2 (Millipore 1:200). 

Co-immunoprecipitation. 293T cells were transiently transfected with GFP (ne- 
gative control) or indicated Flag-tagged versions of YAP2 (Plasmids purchased 
from Addgene, originally constructed in the laboratory of M. Sudol) or Myc- 
tagged DVL2 (from the X. He lab) using Lipofectamine 2000. After 48h, cells 
were lysed in buffer containing 1% Triton X-100, 50 mM Tris HCl pH 7.4, 150 mM 
NaCl and 1 mM EDTA followed by passage through a 26-gauge needle ten times. 
Lysates were pre-cleared with protein-A agarose and then incubated in anti-Flag 
resin (Sigma Aldrich) for 1 h 30 min while shaking. Protein complexes were then 
washed four times in lysis buffer and boiled for western detection. Mouse anti-Flag 
M2 antibody (Sigma Aldrich) was used at 1:1,000 to detect Flag-tagged YAP and 
rabbit anti-Myc antibody (Sigma Aldrich 1:1,000) was used to detect Myc-tagged 
DV12. 

Subcellular fractionation. DLD1 cells were reverse transfected with the indicated 
siRNA. After 4 days, cells were washed with PBS, scraped and centrifuged. Cells 
were then resuspended in 3-4 pellet volumes of fractionation buffer containing 
sucrose, HEPES, KCl, MgCl, EDTA, NP-40, protease and phosphatase inhibitors. 
Cells were then passed through a 25.5-gauge needle ten times and left on ice for 
20 min and then centrifuged to pellet nuclei away from cytoplasm. Nuclei were 
then washed by passage through a 25.5-gauge needle ten times and centrifuged. 
Nuclear and cytoplasmic fractions were then resuspended in equal volumes of lysis 
buffer and analysed by western blots. The following antibodies were used: mouse 
anti-YAP (Santa Cruz 1:1,000), rabbit anti-DVL2 (Cell Signaling 1:1,000), anti- 
fibrillarin (Abcam 1:1,000), anti-B-tubulin (Cell Signaling 1:1,000). 

Wnt reporter assays. 293T cells were first transfected with the indicated siRNA. 
Two days later, TOPflash and renilla plasmids were cotransfected. Wnt activity 
was assayed 48 h after reporter transfection. All values are represented as the ratio 
of firefly to renilla. Transfections were performed in triplicate. 

A stable 293T cell line was used in BIO and Wnt3A plus Rspol induction 
experiments. This line was created by co-transfecting the TOPflash plasmid with 
a renilla-expressing plasmid (10:1 ratio of firefly to renilla) carrying a blasticidin 
resistance selection marker. Cells were selected under 6 pgml *. Clones were 
picked, expanded and screened for TOPflash activity after treatment with recom- 
binant Wnt3A. YAP and scrambled siRNAs were transfected at 7 nM, and induced 
with Wnt3A and Wnt3A (500 ng ml *) plus RSpondin1 (500 ng ml — 3 days later 
when cells were approximately 50% confluent, for 16 h. Induction with bromoin- 
dirubin (5 uM) (BIO Sigma Aldrich) was performed in the same manner as with 
Wnt3A and Rspol. Controls were treated with DMSO. To test the effect of nuclear 
localized DVL in 293T WNT reporter assays, a DVL-NLS-expressing construct” 
was transiently transfected with renilla and TOPflash plasmids 3 days after siRNA 
treatment. Luciferase levels were measured 48 h later. 

Microarray analysis. In transgenic experiments, whole RNA was extracted from 
mouse crypts. RNA was extracted from whole epithelium in Rspol experiments. 
Microarray experiments were performed by the Microarray Core of the Molecular 
Genetics Core Facility at the Children’s Hospital Boston. Material was processed 
for Affymetrix Mouse GeneChip 1.0ST arrays. For each experimental condition, 
three mice were used. CEL files were converted to GCT files using Expression- 
FileCreator in Gene Pattern, publicly available from the Broad Institute (http:// 
www.genepattern.broadinstitute.org). GCT files were preprocessed and used in 
the Comparative Marker Selection module to obtain lists of differentially regulated 
genes. In transgenic experiments, a threshold of P = 0.05 and fold change of 1.4 or 
greater was used. In Rspol experiments, a fold change of 2 and q value (FDR- 
corrected P value) of 0.05 were used as cut-offs. Expression analysis of human 
xenograft tumours was performed using Affymetrix Human GeneChip 1.0ST 
chips and processed as above. GCT files were used in Comparative Marker 
Selection. RNA was used from three control (no Dox) and three YAP(S127D) 
Dox-induced tumours. Threshold cut-offs were P = 0.05 and a fold change of 1.4- 
fold or greater. GSEA was performed using GCT files from Yap experiments 
compared to gene sets cited in the main text. 

Organoid culture and lentiviral infection. Organoids were derived, propagated 
and infected according to previously published work***’. pINDUCER20*° was 
used for inducible lentiviral expression studies in DLD1 xenografts and organoids. 
Human study population. The databases of two nationwide prospective cohort 
studies were used: the Nurses’ Health Study (N = 121,701 women followed since 
1976); and the Health Professionals Follow-up Study (N = 51,529 men followed 
since 1986)°*. We collected paraffin-embedded tissue blocks from hospitals 
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throughout the United States where patients underwent colorectal cancer resec- 
tions. Hematoxylin and eosin stained tissue sections from all colorectal cancer 
cases were reviewed by a pathologist (S.O.) unaware of other data. Tumour grade 
was categorized as low versus high (>50% versus = 50% gland formation). We 
excluded cases that were preoperatively treated. Patients were observed until death 
or until January 1, 2011, whichever came first. Death of a participant was con- 
firmed by the National Death Index. Returning a questionnaire indicated 
informed consent from all study participants. This study was approved by the 
Human Subjects Committees at Harvard School of Public Health and Brigham 
and Women’s Hospital. 

Immunohistochemistry on human samples. Tissue microarrays were con- 
structed as previously described”. For YAP immunohistochemistry, deparaffinized 
tissue sections were treated with Antigen Retrieval Citra Solution (Biogenex 
Laboratories) in a microwave for 15 min. Tissue sections were then incubated with 
Peroxidase Blocking Reagent (15 min; DAKO) and primary antibody against YAP 
(rabbit polyclonal, 1:400 dilution; Cell Signaling) was applied, and slides were 
incubated for 16h at 4°C. Next, we applied SignalStain Boost IHC Detection 
Reagent (Cell Signaling) for 30 min followed by visualizing signal with diamino- 
benzidine (5 min; DAKO) and haematoxylin counterstain. Each immunohisto- 
chemical maker was evaluated by a pathologist (T.M.) unaware of other data. 
Statistical analysis. All statistical analyses were performed with the SAS program 
(version 9.1, SAS Institute). All P values were two-sided and statistical significance 


was set at P= 0.05. For categorical data, the chi-square test was performed. 
Kaplan-Meier method and log-rank test were used for survival analyses. 


31. adison, B. B. etal. cis elements of the villin gene control expression in restricted 
domains of the vertical (crypt) and horizontal (duodenum, cecum) axes of the 
intestine. J. Biol, Chem. 277, 33275-33283 (2002). 

32. El Marjou, F. et al. Tissue-specific and inducible Cre-mediated recombination in 

he gut epithelium. Genesis 39, 186-193 (2004). 

33. Barker, N. et al. Identification of stem cells in small intestine and colon by marker 

gene Lgr5. Nature 449, 1003-1007 (2007). 

34. Colnot,S. etal. Colorectal cancers ina new mouse model of familial adenomatous 

polyposis: influence of genetic and environmental modifiers. Lab. Invest. 84, 

1619-1630 (2004). 

35. Barker, N. et a/. Crypt stem cells as the cells-of-origin of intestinal cancer. Nature 
457, 608-611 (2009). 

36. Koo, B.K. etal. Controlled gene expression in primary Lgr5 organoid cultures. 
Nature Methods 9, 81-83 (2012). 

37. Sato, T. etal. Single Lgr5 stem cells build crypt-villus structures in vitro without a 
mesenchymal niche. Nature 459, 262-265 (2009). 

38. Meerbrey, K. L. et al The pINDUCER lentiviral toolkit for inducible RNA 
interference in vitro and in vivo. Proc. Nat! Acad. Sci. USA 108, 3665-3670 
(2011). 

39. Ogino, S. et al. Combined analysis of COX-2 and p53 expressions reveals 
synergistic inverse correlations with microsatellite instability and CpG island 
methylator phenotype in colorectal cancer. Neoplasia 8, 458-464 (2006). 


©2013 Macmillan Publishers Limited. All rights reserved 


Maes Tea 


doi:10.1038/nature11699 


Ca** regulates T-cell receptor activation by 
modulating the charge property of lipids 


Xiaoshan Shi>**, Yunchen Bi**, Wei Yang"*, Xingdong Guo!?, Yan J iang'?, Chanjuan Wan’, Lunyi Li>?, Yibing Bai? Jun Guol?, 
Yujuan Wang”, Xiangjun Chen*, Bo Wu’, Hongbin Sun”, Wanli Liu*, Junfeng Wang® & Chenqi Xu'* 


Ionic protein-lipid interactions are critical for the structure and 
function of membrane receptors, ion channels, integrins and many 
other proteins’”. However, the regulatory mechanism of these 
interactions is largely unknown. Here we show that Ca** can bind 
directly to anionic phospholipids and thus modulate membrane 
protein function. The activation of T-cell antigen receptor-CD3 
complex (TCR), a key membrane receptor for adaptive immunity, 
is regulated by ionic interactions between positively charged CD3¢/¢ 
cytoplasmic domains (CD3cp) and negatively charged phospho- 
lipids in the plasma membrane’*"°. Crucial tyrosines are buried in 
the membrane and are largely protected from phosphorylation in 
resting T cells. It is not clear how CD3cp dissociates from the mem- 
brane in antigen-stimulated T cells. The antigen engagement of even 
a single TCR triggers a Ca”* influx"! and TCR-proximal Ca** con- 
centration is higher than the average cytosolic Ca”* concentration”. 
Our biochemical, live-cell fluorescence resonance energy transfer 
and NMR experiments showed that an increase in Ca** concentra- 
tion induced the dissociation of CD3cp from the membrane and 
the solvent exposure of tyrosine residues. As a consequence, CD3 
tyrosine phosphorylation was significantly enhanced by Ca’ influx. 
Moreover, when compared with wild-type cells, Ca”* channel- 
deficient T cells had substantially lower levels of CD3 phosphory- 
lation after stimulation. The effect of Ca** on facilitating CD3 
phosphorylation is primarily due to the charge of this ion, as demon- 
strated by the fact that replacing Ca”* with the non-physiological ion 
Sr** resulted in the same feedback effect. Finally, **P NMR spectro- 
scopy showed that Ca** bound to the phosphate group in anionic 
phospholipids at physiological concentrations, thus neutralizing the 
negative charge of phospholipids. Rather than initiating CD3 phos- 
phorylation, this regulatory pathway of Ca”* has a positive feedback 
effect on amplifying and sustaining CD3 phosphorylation and 
should enhance T-cell sensitivity to foreign antigens. Our study thus 
provides a new regulatory mechanism of Ca”* to T-cell activation 
involving direct lipid manipulation. 

TCR is one of the most complicated cell surface receptors and is 
composed of a ligand-sensing TCRaf subunit and the three signalling 
subunits CD3e6, ey and CC. TCR recognition of the peptide—-major 
histocompatibility complex (pMHC) is an essential step in initiating 
the host adaptive immune response against invading pathogens”’. 
Antigen binding first triggers the tyrosine phosphorylation of the 
immunoreceptor tyrosine-based activation motifs (ITAMs) in CD3 
chains, which is primarily mediated by Lck"*, and consequently elicits 
a comprehensive signalling network hallmarked by Ca** influx through 
the Ca”*-release-activated Ca** channel (CRAC)'*"®. The regulatory 
mechanism of ITAM phosphorylation is a longstanding puzzle’. 
Another central question that remains is how T cells can be activated 
by minute quantities of foreign antigen'’’*. Recent work shows that 
ITAM phosphorylation is regulated by the ionic interaction between 


the positively charged ITAM and the negatively charged phospholipids 
in the plasma membrane’*"'*!*”. In resting T cells, the CD3e and ¢ 
cytoplasmic domains bind to the plasma membrane, and their ITAM 
tyrosines are inserted into the hydrophobic core of the plasma mem- 
brane’*"°. The insertion prevents spontaneous ITAM phosphorylation 
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Figure 1 | Ca’* induced the dissociation of CD3ecp from the membrane 
bilayer. a-f, The effect of Ca?* on the CD3ecp-membrane interaction was 
measured by an equilibrium-based microdialysis assay (a, b) and a live-cell 
FRET imaging experiment (c-f). a, In the absence of Ca?*, CD3écp bound 
strongly to POPG large unilamellar vesicles, not to POPC large unilamellar 
vesicles. The increase of Ca~* concentration significantly impaired the 
CD3écp-POPG interaction. Triplicate measurements were made for each 
condition and the results (mean + s.d.) were plotted as the percentage of the 
fluorescence intensity in the experimental (POPG) and control (POPC) 
chambers. b, The results in a were converted to the binding efficiency of 
CD3écp to POPG and presented as mean + s.d. c-f, The FRET efficiency was 
measured between TFP1 and the membrane dye R18 by the dequenching 
approach in live Jurkat T cells. Ionomycin (5 LM) was used to induce Ca** 
influx. The FRET efficiencies of the 3aa-TFP1 and 50 aa-TFP1 control 
constructs were not sensitive to Ca”* influx, whereas the FRET efficiency of 
CD3écp-TFP1 was reduced significantly in response to Ca** influx. AP, after 
photobleaching; BP, before photobleaching. The FRET efficiencies were 
measured for more than 15 cells per condition. Each dot represents the FRET 
value from one individual cell. Unpaired two tailed Student’s t-test was used for 
the statistical analysis. 
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in unstimulated T cells even when active Lck proteins are constitutively 
available’. In antigen-stimulated T cells, ITAMs need to be dislodged 
from the plasma membrane and become accessible for Lck. Immediately 
after the initial TCR triggering, Ca** locally influxes into T cells”, Ca" 
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Figure 2 | Ca”* induced the solvent exposure of tyrosine residues in 
CD3écp ITAM. a, Superimposed '°N-'H HSQC spectra of CD3€cp with 
POPG bicelles (blue), CD3écp with POPG bicelles and Ca** with different 
concentrations (cyan, yellow and purple), and CD3écp with control POPC 
bicelles (red). Membrane-bound CD3¢€cp (blue) and solvent-exposed CD3ecp 
(red) had very different amide resonance spectra. Ca?* was titrated into the 
CD3€cp-with-POPG sample at a molar ratio of [Ca*"]:[POPG] from 0.2 to 0.8. 
In response to the increase in the Ca’* concentration, the CD3¢cp amide 
resonances exhibited systematic shifts from the membrane-bound state to the 
solvent-exposed state. The resonance changes of two tyrosine residues, Tyr 38 
and Tyr 49, are enlarged in the inset. HSQC experiments were performed with 
0.05 mM '°N-labelled CD3ecp and 7.5 mM POPG or POPC bicelles (q = 0.8, 
is the molar ratio of long-chain lipids (POPG or POPC) to short-chain lipids 
(DHPC)) in 20 mM Tris-HCl buffer, pH 7.0. b, Strips from aromatic NOESY 
spectra showing NOEs (distance of <5 A) between the aromatic protons He/ 
H6 of Y38 and Y49 and the methylene protons in the lipid acyl chains. The 
substantial NOE signals (marked by asterisk) observed in the absence of Ca** 
indicated the insertion of tyrosine side chains into the membrane hydrophobic 
interior. The addition of Ca*~ resulted in the significant impairment of NOE 
signals, which indicated that tyrosine side chains were no longer embedded in 
the membrane core region. NOESY experiments were performed with 0.05 mM 
1SN-, C-labelled CD3ecp, 7.5 mM POPG bicelles (q = 0.8) and 0-3 mM Ca?* 
in 20 mM Tris-HCl buffer, pH 7.0. Ca?* was titrated in at a molar ratio relative 
to POPG of 0.2 to 0.4. c, Superimposed tyrosine fluorescence emission (TFE) 
spectra of CD3€cp under different conditions. The peak value of the 
membrane-bound CD3écp TFE spectrum (red) was much higher than that of 
solvent-exposed CD3écp spectrum (black). Titrating in Ca’* to the 
membrane-bound CD3écp sample substantially reduced the peak value 
(orange, green, blue and purple), confirming that Ca?* induced the solvent 
exposure of tyrosine aromatic rings. d, The experiments shown in c were 
repeated three times, and the increase in the TFE value (at 310 nm) upon lipid 
binding, calculated as F(+lipid) - F(—lipid), was plotted as mean = s.d. 
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ions are not evenly distributed in T cells and we found that Ca** micro- 
domains co-localized with TCR (Supplementary Fig. 1 and Sup- 
plementary Videos 1-3)”. We therefore propose that this divalent cation 
could directly change the local electrostatic environment and interfere 
with the ionic CD3c¢p-membrane interaction, thus facilitating ITAM 
phosphorylation. 

We first performed a microdialysis assay to study whether Ca”* 
could interfere directly with the CD3-p-membrane interaction. The 
addition of Ca” significantly impaired the specific interaction of 
CD3écp or CD3cp to anionic phospholipids, even when the Ca’* 
to lipid concentration ratio is only 0.1:1 (Fig. 1a, b and Supplementary 
Fig. 2). Another fluorescence polarization experiment also confirmed 
this observation (Supplementary Fig. 3). The effect of Ca** was still 
profound at physiological Ca** concentrations (Supplementary Fig. 4). 
We then performed a fluorescence resonance energy transfer (FRET)- 
based experiment to measure the effect of Ca?* in live T cells. As 
described previously’, the cytoplasmic domain of a monomeric mem- 
brane protein, KIR2DL3, was replaced by CD3€cp with a carboxy- 
terminal teal fluorescence protein (monomeric TFP1). The distance 
between CD3écp and the plasma membrane was determined by 
measuring the FRET efficiency between TFP1 and a membrane dye, 
octadecyl rhodamine B (R18). To help translate the FRET efficiency 
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Figure 3 | Ca’* facilitated CD3 phosphorylation. a, b, An in vitro 
phosphorylation assay was performed to study CD3¢cp tyrosine 
phosphorylation by Lck under different Ca** concentrations. a, The tyrosine 
residues of CD3écp were efficiently phosphorylated by Lck. The addition of 
POPG bicelles but not control POPC bicelles prevented CD3€cp 
phosphorylation. Ca’* can reverse this blockade and rescue CD3écp tyrosine 
phosphorylation. pCD3écp, phosphorylated CD3ecp. b, Ca?* did not directly 
increase the activity of Lck for phosphorylating CD3¢cp tyrosine residues. 
The Ca’* concentration was raised from 0 to 0.5, 5, 50, 500 and 5,000 EM (left 
to right). c, Jurkat T cells were stimulated by TCR crosslinking (0.2 pg ml! 
anti-CD3 (UCHT1) plus 0.2 ug ml! anti-mouse IgG) for the indicated time at 
37 °C. The levels of CD3¢ and € phosphorylation were substantially greater in 
Ca’* -containing triggering buffer (1 mM Ca**) compared with Ca** -free 
buffer. When intracellular Ca?* was chelated with 10 uM BAPTA, the levels of 
both CD3¢ and € phosphorylations were significantly reduced. d, CRAC- 
channel-deficient Jurkat mutant CJ-1 cells only had weak CD3¢ and ¢ 
phosphorylation after TCR crosslinking, indicating that Ca”* influx was 
important for CD3 phosphorylation. e, To essentially rule out the potential 
contribution of other Ca** -dependent signalling pathways in CD3 
phosphorylation, non-physiological Sr’* (0.5 mM) was used instead of Ca”* 
(1 mM) as the only divalent cation in the triggering buffer. A stable Jurkat cell 
line overexpressing constitutively active Lck (Supplementary Fig. 10b) was used 
for this experiment. Cells were stimulated using the same condition as in c. The 
levels of CD3¢ and € phosphorylation were substantially greater under Sr**- 
containing condition compared with Sr’*-free condition, even when Lck 
activity was already saturated. 
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into spatial distance, two control constructs with flexible linkers of 
different lengths (3 and 50 amino acids (aa), respectively) between 
KIR2DL3 transmembrane domain and TFP1 were used. The FRET 
efficiencies of these two constructs represent the states in which 
CD3écp is fully membrane-bound (3aa) or unbound (50aa). In 
resting cells, CD3&cp-TFP1 showed substantial FRET efficiency, sim- 
ilar to that of 3 aa-TFP1, indicating that CD3¢cp bound to the plasma 
membrane. After Ca** influx induced by ionomycin treatment or 
TCR crosslinking, the FRET efficiency of CD3écp-TFP1 declined 
significantly to the level similar to that of 50 aa-TFP1 (Fig. le, f and 
Supplementary Fig. 5). In contrast, the FRET efficiencies of both 3 aa- 
TFP1 and 50aa-TFP1 were not sensitive to ionomycin treatment 
(Fig. 1c, d, f). These data support the notion that Ca?* influx can 
disrupt the ionic CD3¢cp-lipid interaction and result in the dissoci- 
ation of CD3écp from the plasma membrane. 

We further used solution-state multi-dimensional NMR spectro- 
scopy to trace the dynamics of the CD3gcp-membrane interaction 
in response to increase in Ca’* concentration. As measured by a 
1SN-'H_ heteronuclear single-quantum coherence (HSQC) experi- 
ment, titrating in Ca** caused systematic shifts of amide resonances 
of all CD3€cp residues, including the key ITAM tyrosines, Tyr 38 and 
Tyr 49, from membrane-bound state to solution state (Fig. 2a and 
Supplementary Fig. 6a). CD3Ccp showed similar sensitivity to increase 


a b 


i) 
0.4 
0.2 
Ih\ 0.4 
0.2 [Ca®]:[POPG] OS [Ca*}{POPC] 


LETTER 


in Ca** concentration (Supplementary Fig. 7). As one of the major 
groups interacting with anionic lipids, the side-chain amine groups of 
most arginines in CD3ecp showed intense peaks in the membrane- 
bound CD3cp HSQC spectrum. Those peaks were largely missing in 
the soluble CD3¢cp spectrum because of the fast proton exchange with 
the solvent. The addition of Ca”* to the membrane-bound CD3¢cp 
sample greatly reduced the signal intensity of the arginine side-chain 
amine groups, again confirming the ability of Ca~* to induce the 
solvent exposure of CD3écp (Supplementary Fig. 6b). This effect 
was not caused by a Ca?*-induced CD3€cp conformational change 
because adding Ca** to the CD3¢cp solution in the absence of bicelles 
did not result in any specific chemical shift perturbation (Supplemen- 
tary Fig. 6c). Our circular dichroism experiment also verified the effect 
of Ca** on disrupting the ionic CD3gcp-membrane interaction 
(Supplementary Fig. 8). Upon membrane binding, the aromatic rings 
of tyrosines insert into the hydrophobic core of the lipid bilayer, and 
substantial nuclear Overhauser effect (NOE) signals (distance <5 A) 
between the tyrosine aromatic protons and the lipid acyl chain protons 
were detected (Fig, 2b). Adding Ca’* substantially weakened the tyrosine- 
lipid NOE signals, indicating that the tyrosine side chains were no longer 
embedded in the membrane core region. A tyrosine fluorescence 
emission experiment confirmed the Ca**-induced solvent exposure 
of ITAM tyrosines (Fig. 2c, d). 

We then tested whether Ca”* could facilitate CD3 phosphorylation 
by both in vitro phosphorylation (IVP) and live T-cell stimulation 
experiments. The IVP data showed that binding to anionic lipids pre- 
vented CD3écp phosphorylation by Lck, but Ca** could reverse this 
blockade and rescue CD3€cp phosphorylation (Fig. 3a). This is not 
owing to the enhancement of Lck activity by an increase in Ca** 
concentration (Fig. 3b). The physiological foreign antigen density on 
the antigen-presenting cell surface is generally very low’”. To mimic 
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Figure 4 | Ca”* bound to the phosphate group of anionic phospholipids at 
physiological concentrations. a-d, One-dimensional *'P NMR experiments 
were performed to probe the local chemical environment change of the 
phospholipid phosphate group induced by the binding of Ca** or CD3ecp 
peptide. A lipid nanodisc system was used to provide a membrane bilayer 
environment without the interference of detergent molecules. The sample 
buffer was 20 mM Tris-HCl, pH 7.0, containing 100 mM NaCl. a, b, Ca*t 
induced a substantial chemical shift change of the phosphorus signal of the 
anionic POPG (a), but not that of the zwitterionic POPC (b). c, The CD3écp 
peptide induced a shift in the same direction for the POPG phosphorus signal 
as Ca’ did, implying that both Ca”* and basic residues in CD3ecp bound to 
the same region in the phospholipid headgroup via ionic interactions. Mutation 
of the first three basic residues to Ala in CD3ecp (CD3£cp mutant) nearly 
eliminated the perturbation effect of CD3gcp to the lipid phosphorus signal. 
d, The effect of Ca** in perturbing the *'P signal was tested at physiological 
Ca”* concentrations. Even 1 tM Ca”* can induce unambiguous chemical shift 
changes of the POPG phosphorus signal. The POPG concentration used in 

a and c was 4mM; in d the concentration was 0.8 mM. The POPC 
concentration used in b was 5 mM. The CD3€cp concentration used in c was 
0.08 mM. e, A schematic illustration of the Ca**-induced TCR signalling 
amplification model. In the resting state, the positively charged CD3e/€ 
cytoplasmic domains interact with anionic phospholipids in the inner leaflet of 
the plasma membrane and key tyrosine residues are sequestered in the 
membrane bilayer, which provides a ‘safety’ control on TCR triggering'*"'®. It 
has been well studied that Ca?* influx starts within a few seconds after the 
initial TCR triggering” and the influx persists for several hours in synapsed T 
cells”, Membrane-proximal Ca’* ions can then bind to the phosphate group in 
anionic phospholipids and neutralize their negative charges, which results in 
the dissociation of CD3¢/€ cytoplasmic domains from the membrane and 
increases the accessibility of ITAM for Lck. Ca** can thus facilitate the 
phosphorylation of TCR-CD3 complexes, especially those contacting with 
low-affinity self antigens or even bystanders. This feedback regulation of Ca”* 
can amplify the initial antigen-stimulated signal to a greater magnitude, which 
helps explain the unique nature of the hypersensitivity of T cells to even a single 
antigen. Moreover, the persistent high Ca** concentration in synapsed T cells 
could help sustain TCR signalling at a certain level, which is required for the full 


29 
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physiological conditions, we activated T cells with anti-CD3 antibody 
at a low concentration. For both Jurkat and mouse primary T cells, 
CD3 phosphorylation was substantially stronger when Ca** was pre- 
sent in the stimulating buffer (Fig. 3c and Supplementary Fig. 9a). 
Chelating intracellular Ca’* with BAPTA (1,2-bis(o-aminophenoxy) 
ethane-N,N,N’,N’-tetraacetic acid) significantly reduced CD3 phos- 
phorylation. CJ-1, a Jurkat mutant cell line having only residual 
expression of CRAC channel’*”*, had much weaker CD3 phosphory- 
lation after T-cell activation when compared with wild-type Jurkat 
cells (Fig. 3d). Moreover, the presence of Ca~* in the stimulating buffer 
did not cause substantial enhancement of CD3 phosphorylation in CJ- 
1 cells (Supplementary Fig. 9b). The effect of Ca** on CD3 phosphory- 
lation is not caused by the direct enhancement of Lck activity or the 
suppression of tyrosine phosphatase activity. When active Lck was 
saturated or major tyrosine phosphatases were inhibited in T cells, 
the effect of Ca”* on CD3 phosphorylation was still profound (Sup- 
plementary Figs 10 and 11). To further demonstrate that the role of 
Ca’* in CD3 phosphorylation is primarily contributed by its charge, 
not other Ca’ " -dependent signalling pathways, we replaced Ca’ * with 
Sr’*, a non-physiological divalent cation that can also pass through 
the CRAC channel’ but is unable to trigger Ca**-dependent signal- 
ling pathways in T cells (Supplementary Fig. 12). Similar to Ca7*, Sr°* 
disrupted the CD3ecp-membrane interaction and facilitated CD3 
phosphorylation (Fig. 3e, Supplementary Fig. 13). 

We next studied the mechanism by which Ca** undermines the 
ionic CD3ecp-lipid interaction. We used *‘P NMR spectroscopy to 
probe directly the change in the chemical environment of the nega- 
tively charged lipid phosphorus after Ca”* binding. Lipid nanodiscs 
were used here instead of bicelles to avoid the signal interference 
from the detergent molecule”’. Ca” -binding induced substantial che- 
mical shift change of the phosphorus signal of anionic lipids, but not 
that of the zwitterionic POPC (1-palmitoyl-2-oleoyl-sn-glycero-3- 
phosphocholine; Fig. 4a, b and Supplementary Fig. 14a, b). This per- 
turbation effect can be reversed by the Ca”* chelator EGTA (ethylene 
glycol tetraacetic acid; Supplementary Fig. 14c). Same directional shift 
of the phosphorus signal was induced by CD3¢cp binding, indicating 
that both Ca** and basic residues in CD3¢cp bound to the same lipid 
phosphate region (Fig. 4c). The Ca**-lipid phosphate binding was 
further confirmed by a Fourier transform infrared experiment (Sup- 
plementary Fig. 15). We then tested the effect of Ca** at low micro- 
molar level to mimic the physiological Ca”* concentration in activated 
T cells'®. An unambiguous perturbation effect on the *'P signal was 
observed even at 11M Ca’", indicating that the binding of Ca** and 
the lipid phosphate group is physiologically relevant (Fig. 4d). This 
binding thus explains a mechanism responsible for the role of Ca”* in 
disrupting the ionic CD3cp-membrane interaction and facilitating 
CD3 phosphorylation. As expected, Sr’* can also specifically perturb 
the *'P signal of anionic lipids (Supplementary Fig. 16). 

This study unveils a new function of Ca** for feedback amplifying 
initial antigen-stimulated TCR signalling (Fig. 4e, Supplementary 
Discussion and Supplementary Fig. 17). This feedback loop will not 
cycle infinitely because the local Ca** concentration is precisely con- 
trolled by the plasma membrane Ca** ATPase pumps, mitochondria 
and other ion channels!*”’. Moreover, TCRs become internalised and 
degraded when they move to the central immunological synapse**. The 
Ca**-mediated signal amplification process helps explain the unique 
nature of the hypersensitivity of T cells to even a single antigen 
molecule!""’. Ca*” should also help sustain TCR signalling because 
the elevation of Ca”* concentration persists for several hours in 
synapsed T cells’*”*. Future work will be needed to fully demonstrate 
the in vivo relevance of our model and its potential applications to other 
signalling pathways involving ionic protein-lipid interactions”. 


METHODS SUMMARY 


Mouse primary CD4* T cells were sorted from C57/B6 mice splenocytes using 
CD4* T cell Microbeads from Miltenyi Biotec. A CRAC-deficient Jurkat cell line 
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CJ-1 was generously provide by R. Lewis. Jurkat and mouse CD4"* primary T cells 
were stimulated by anti-CD3 crosslinking (for Jurkat cells, 0.2 ugml-’ UCHT1 
plus 0.2 ug ml! anti-mouse IgG; for primary T cells, 0.5 pg ml! 145-2C11 plus 
1pgml’ anti-hamster IgG) for the indicated time in Mg’*-free Ringer’s 
buffer with or without 1 mM Ca’*. To chelate intracellular Ca?*, T cells were 
pretreated with 10 4M BAPTA-AM (Invitrogen) for 30 min at 37 °C before anti- 
body stimulation. CD3 was immunoprecipitated with UCHT1 or 145-2C11 and 
analysed to determine its tyrosine phosphorylation status by anti-pY 100 immuno- 
blotting (Cell Signaling Technology). The tyrosine phosphorylation of CD3¢ 
was detected by anti-pCD3¢ (Santa Cruz Biotechnology) immunoblotting of cell 
lysate samples. 

All NMR spectra were acquired at 27°C on Bruker Avance 600 MHz and 
850 MHz spectrometers. The acquired data were further processed using the 
software package NMRPipe and analysed with the program Sparky (Goddard & 
Kneller, University of California, San Francisco, California, USA). Due to the low 
concentration of protein/lipid/Ca** mixture samples, two-dimensional proton 
NOESY experiments were conducted instead of three-dimensional '*C-separated 
NOESY experiments to observe the NOEs from CD3écp aromatic protons to lipid 
methyl protons. The binding of Ca”* to the phosphate group of the phospholipid 
was probed by a one-dimensional *'P NMR measurement on the 850 MHz 
spectrometer equipped with a *'P direct-detection channel. Nanodiscs composed 
of different phospholipids (1-palmitoyl-2-oleoyl-sn-glycero-3-phospho-(1’-rac- 
glycerol) (POPG), 1-palmitoyl 2-oleoyl-sn-glycero-3-phospho-L-serine (POPS) 
or POPC) were used for the *'P experiment to eliminate the interference of the 
phosphorus signal from the detergent molecule that is commonly used in other 
lipid bilayer systems”°. *'P spectra were measured with 256 accumulated transients. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Cells and reagents. Mouse CD4* T cells were sorted from C57/B6 mouse sple- 
nocytes using CD4* T cell selection microbeads (Miltenyi Biotec). A CRAC- 
deficient Jurkat cell line, CJ-1, was provided by R. Lewis. Lipids and detergent 
(POPG, POPC, POPS, DHPC (dihexanoyl-phosphatidylcholine)) were from 
Avanti polar lipids. Anti-CD3e (M20), -phospho-CD3¢, -CD3¢ (6B10.2) and 
-glutathione-S-transferase (GST) (Z-5) antibodies were from Santa Cruz 
Biotechnology. Phospho-Y100 antibody was from Cell Signaling Technology, 
UCHT1 from eBioscience, 145-2C11 from Biolegend, HA tag antibody (3F10) 
from Roche. Isotopes for NMR experiments were from Cambridge Isotope 
Laboratories. 

Preparation of antibody-coated chambered coverglass slides. Lab-Tek cham- 
bered coverglass slides (Thermo Fisher Scientific) were cleaned by a strip buffer 
(H2SO4:H20> volume ratio 7:3), washed, and dried. 200 pl anti-human CD3 anti- 
body UCHT1 (40.gml™’ in PBS) was loaded onto the coverglass slides and 
incubated at 37 °C for 30 min. After mild washing with PBS, the coverglass slides 
were blocked with 5% BSA in Ringer’s buffer at 37°C for 2h and then mildly 
washed with Ringer’s buffer before usage. 

High-resolution high-speed time-lapse live cell imaging by total internal 
reflection fluorescence microscopy (TIRFM) and spinning-disk confocal 
microscopy. GCaMP5, a new generation of genetically encoded Ca** indicator 
(Looger Lab, Janelia Farm, HHMI), was used to image Ca?* in live T cells. 
GCaMP5- or HA~human CD3e-GCaMP5-expressing Jurkat transfectants were 
first loaded with 100 14M EGTA-AM (a membrane-permeable form of EGTA; 
Invitrogen) and resuspended in Ringer’s Buffer. EGTA was not used for the 
spinning-disk confocal imaging of HA~hCD3e-GCaMP5-expressing cells. Cells 
were placed on antibody-coated coverglass slides right before imaging. TIRF 
images were acquired at 37 °C ona heated stage by an Olympus IX-81 microscope 
supported by a TIRF port, ANDOR iXon+ DU897D electron-multiplying charge- 
coupled device (EMCCD) camera (ANDOR Technology) and Olympus X 100 1.45 
numerical aperture (NA) lenses. The acquisition was controlled by MetaMorph 
software (Molecular Devices). The exposure time was 100 ms unless otherwise 
indicated. A Coherent Sapphire 488nm diode laser was used. Spinning-disk 
confocal images were acquired by an Olympus IX-81 microscope supported by 
the Yokogawa spinning-disk sets, ANDOR iXon+ DU897D EMCCD and 
Olympus X100 1.45 NA lenses. The acquisition was controlled by ANDOR 
Q2 software (ANDOR Technology). The exposure time and laser used were 
the same with above TIRF set. Images were analysed with Image Pro Plus soft- 
ware (Media Cybernetics). 

CD3écp and CD3€cp peptide expression and labelling. CD3¢cp and CD3cp 
were expressed as a GST fusion protein in Escherichia coli, separated from GST by 
tobacco etch virus protease cleavage and purified by reverse-phase HPLC. For 
microdialysis and fluorescence polarization experiments, peptides were expressed 
with an additional amino-terminal cysteine and labelled by an Alexa488-maleimide 
derivative (Invitrogen). For the in vitro phosphorylation experiment, the GST- 
CD3écp fusion protein was used directly for the unbiased detection of phosphory- 
lated and unphosphorylated CD3¢cp by anti-GST immunoblotting. 
Equilibrium-based microdialysis assay. The microdialysis system was composed 
of two equal size chambers, an experimental chamber and a control chamber’. The 
experimental chamber was loaded with 10nM Alexa488-labelled CD3ecp or 
CD3€cp and large unilamellar vesicles (LUVs) composed of 5mM POPG (an 
anionic phospholipid) in buffer containing 20 mM HEPES, pH7.0 and 100 mM 
KCl, whereas the control chamber was loaded with 10nM Alexa488-labelled 
CD3€cp or CD3Gcp and LUVs composed of 5mM POPC (a zwitterionic phos- 
pholipid) in the same buffer. Ca** was added into the two chambers at a molar 
ratio relative to the phospholipid from 0 to 1. The two chambers were separated by 
a 300-kDa dialysis membrane, which allowed the free movement of the fluorescent 
peptides but not the LUVs. After reaching equilibrium at 16h, the binding of 
CD3écp or CD3Ccp to the anionic phospholipid was calculated on the basis of 
the fluorescence intensity difference between the two chambers. The binding 
efficiency of CD3écp or CD3€cp to acidic phospholipids was calculated by the 
following equation: Binding (%) = (FLExp - FLCon)/(FLExp + FLCon) X 100, 
where FLExp is the fluorescence intensity in the experimental chamber and 
FLCon is the fluorescence intensity in the control chamber. The binding efficien- 
cies under different Ca** concentrations were then normalized to the value mea- 
sured in the absence of Ca". 

To study the effect of Ca?* on CD3ecp-lipid binding at physiological micro- 
molar Ca** concentrations (Supplementary Fig. 4), we optimized the system and 
used 20 1M POPG or POPC LUV, 2 nM Alexa488-CD3¢cp peptide and 0-20 hM 
Ca”* in 20 mM HEPES buffer, pH7.0, containing 150mM KCL. 

Fluorescence polarization assay. To measure the binding kinetics of CD3écp to 
phospholipids and the inhibition of this binding by Ca**, Alexa488-labelled 
CD3écp, POPG/DHPC bicelles (q = 0.8) and Ca’?* were incubated in a 384-well 


plate for 15 min at 27 °C. The sample buffer was 20 mM HEPES, pH 7.0, 150 mM 
NaCl. Fluorescence polarization values were measured using a Tecan Infinite 
M1000Pro Microplate Reader. 

Tyrosine fluorescence emission assay. The tyrosine fluorescence emission spec- 
trum of CD3écp was detected by a Varian Cary Eclipse machine, with the excita- 
tion wavelength 275nm and the emission wavelength 290-390nm. Mixture 
bicelles (30% POPS, 10% POPG and 60% POPC) were used to mimic the physio- 
logical lipid composition in the inner leaflet of the plasma membrane. Tyrosine 
fluorescence emission experiments were performed with 6 4M CD3¢cp, 0.1 mM 
DTT, 0 or 0.5mM POPS/POPG/POPC mixture bicelles (q = 0.8) and 10mM 
Tris-HCl, pH7.4. Ca** was titrated at a molar ratio relative to the total lipid 
concentration of 0.04 to 0.4. 

FRET measurement. One million cells were resuspended in 1 ml of Mg”’ -free 
Ringer’s buffer (155 mM NaCl, 4.5mM KCl, 10mM p-glucose, 10 mM HEPES, 
pH 7.4) containing 2mM CaCl). A 300-l aliquot was pipetted onto the centre of 
35mm glass-bottom dish (Shengyou Biotechnology) and cells were allowed to 
adhere for 5 min at room temperature. To induce Ca** influx, 5 tM ionomycin 
(Sigma) or anti-CD3 antibody (2 pg ml ' UCHT1 and 2 pg ml’ anti-mouse IgG) 
was added into the dish, which was then incubated at room temperature for 
additional 5 min. To label the plasma membrane, 0.5 pil of 500 pg ml! octadecyl 
rhodamine B (R18) (Invitrogen) was added into the dish. The dish was then 
mounted onto a Leica TCS SP5 microscope for imaging. The FRET efficiency 
between TFP1 (donor) and R18 (acceptor) was measured by the dequenching 
method’. TFP1 was excited with the Argon 458 nm laser line and visualized using 
detection at 470-560 nm. R18 was excited with the Helium/Neon 561 nm laser 
line, visualized using detection at 590-650 nm. Unpaired two tailed Student's t-test 
was used for the statistical analysis of all FRET data. 

In vitro phosphorylation assay. CD3écp with an N-terminal GST tag was used in 
the in vitro phosphorylation assay, for the unbiased detection of phosphorylated 
and unphosphorylated CD3&cp peptides by anti-GST immunoblotting. GST- 
CD3é€cp (1 uM) was first incubated with lipids (2mM POPG or POPC bicelles) 
in the presence or absence of 10mM Ca** at 37°C for 30min. The reaction 
volume was 20 jl and the buffer was 20 mM HEPES, pH 7.0, 150mM KCI. Lck 
(Millipore) and kinase buffer were then added to the reaction and the final volume 
was 40 ul, containing 0.5 4M CD3¢cp, 1 mM lipid bicelles, 2 ug ml! Lck, 0 or 
5 mM Ca**,70mM HEPES, pH 7.0,5 mM MgCl2, 3 uM Na3VO4, 1.25 mM DTT, 
150mM KCl and 0.2mM ATP. The reaction was incubated at 30°C for 1h. 
Tyrosine-phosphorylated and total CD3ec¢p were detected by anti-pY100 and 
anti-GST immunoblotting, respectively. 

To test whether Ca** concentration increase could directly enhance Lck activity, 
Lek (2 ug ml‘) was incubated with CD3&cp (0.5 uM) and Ca”* (0-5 mM) at 
30°C for 1h to phosphorylate CD3¢cp tyrosine residues. 

T-cell activation and immunoblotting. Jurkat cells and mouse CD4* primary T 
cells were stimulated at 37 °C by TCR crosslinking (for Jurkat: 0.2 pg ml anti- 
CD3 (UCHT1) plus 0.2 ug ml" anti-mouse IgG; for primary T cells, 0.5 pg ml! 
anti-CD3 (145-2C11) plus 1 pg ml ' anti-hamster IgG) for the indicated time in 
Mg”* -free Ringer’s buffer with or without 1 mM Ca’". To chelate intracellular 
Ca’*, the cells were pretreated with 10 uM BAPTA-AM (Invitrogen) for 30 min at 
37°C. Stimulation reactions were terminated by adding ice-cold Mg”*-free 
Ringer’s buffer. For the detection of CD3e phosphorylation, cells were lysed in 
an immunoprecipitation buffer (1% NP40, 50mM Tris-HCl, pH7.4, 155mM 
NaCl, 2mM EDTA, 2mM Na3VQx,, 20mM NaF, 10 mM iodoacetamide, 1 mM 
PMSF and complete protease inhibitor cocktail (Sigma)) and then subjected to 
CD3¢ immunoprecipitation by UCHT1 or 145-2C11. For the detection of CD3¢ 
phosphorylation, stimulated cells were directly lysed in SDS-PAGE sample buffer. 
Immunoprecipitated samples and cell lysate samples were separated by SDS- 
PAGE, transferred onto PVDF membranes (Millipore) and subjected to immuno- 
blotting. CD3e phosphorylation and CD3¢ phosphorylation were detected by 
anti-pY100 and anti-pCD3¢ immunoblotting, respectively. Membranes were then 
stripped off and re-blotted with anti-CD3¢ or anti-CD3¢ (6B10.2) for the detection 
of total CD3e and CD3<, respectively. 

HA crosslinking. To study how receptor aggregation affects [TAM-membrane 
interaction, we generated a chimaera construct, HA-~KIR-CD€cp-TFP1, which 
contains an N-terminal HA tag, KIR2DL3 extracellular and transmembrane 
domains, CD3¢ cytoplasmic domain and a C-terminal monomeric TFP1. Stable 
Jurkat cell lines expressing this construct were generated by lentiviral infection and 
FACS sorting. For FRET measurement, one million Jurkat transfectants were 
resuspended in 1 ml Ca** and Mg” *-free Ringer’s buffer. A 300 ll cell suspension 
aliquot was pipetted onto the centre of 35 mm Glass bottom dish and cells were 
allowed to adhere for 5 min at the room temperature. To crosslink the monomeric 
CD3¢ chimaera, 3 ull of 20 tg ml’ rat anti-HA antibody and 20 pg ml anti-rat 
IgG mixture was added into the dish and incubated at the room temperature for 
additional 5 min. FRET measurements were performed as described above. For the 
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chimaera phosphorylation experiment, Jurkat transfectants were stimulated at 
37°C by anti-HA crosslinking (51g ml“! anti-HA (3F10, Roche) + 10g ml~! 
goat anti-rat IgG) for indicated times in Ca**, Mg”*-free Ringer’s buffer. Cells 
were then lysed and the chimaera was pulled down by anti-HA antibody immuno- 
precipitation. Immunoprecipitated samples were separated by SDS-PAGE, trans- 
ferred onto PVDF membrane and subject for immunoblotting. CD3Gcp phos- 
phorylation was detected by anti-pCD3¢ immunoblotting (pY142, Anbo biotech). 
Membranes were then stripped off and re-blotted by anti-HA for the detection of 
total HA-KIR-CD3C¢p-TEP1. 

Preparation of nanodiscs, bicelles and LUVs. To assemble nanodiscs, the 
MSP1D1/lipid/sodium cholate reaction mixture was incubated for 15h at the 
appropriate temperature (POPC 4°C, POPG/POPS 14°C) in buffer containing 
20 mM Tris-HCl, pH 7.2, 100 mM NaCl and 0.5 mM EDTA”. The molar ratio of 
MSP1D1:lipid:sodium cholate was 1:65:130. The nanodisc self-assembly process 
was initiated upon the removal of the sodium cholate by incubating the mixture 
with Bio-Beads SM-2 (0.8 g per ml reaction mixture, Bio-Rad) on a shaker for 8h 
at room temperature After filtering off Bio-Beads with syringe filters, the nanodisc 
samples were purified on a Superdex 200 gel-filtration column (GE Healthcare) 
equilibrated in buffer containing 20mM Tris-HCl, pH7.0 and 100mM NaCl. 
The peak fractions were collected and concentrated using an ultracentrifuga- 
tion column with a 30,000 MW cutoff (Millipore). Large unilamellar vesicle 
and bicelle samples were prepared as previously described’. 
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NMR experiments. All NMR spectra were acquired at 27 °C on Bruker Avance 
600 MHz and 850 MHz spectrometers. The acquired data were further processed 
using the software package NMRPipe™ and analysed with Sparky (Goddard & 
Kneller, University of California, San Francisco, California, USA). 

Aromatic NOESY. Because of the low concentration of the protein/lipid/Ca*~ 
mixture samples, two-dimensional proton NOESY experiments were conducted 
to observe NOEs from CD3¢€cp aromatic protons to lipid methyl protons. For the 
indirect dimension, background signals from CD3écp were suppressed by double 
13C filtering, and only lipid methyl protons evolved. For the direct dimension, the 
aromatic protons Hé and He from tyrosines were selected by aromatic '°C edition. 
The samples were diluted in D,O to achieve better water suppression and reduce 
the spin diffusion effect. Experiments were conducted on the Bruker 600 MHz 
spectrometer at 27 °C with a NOE mixing time of 300 ms, 128 points in the indirect 
dimension, 2,048 points in the direct dimension and 1,024 accumulated transients. 
31P spectrum. The binding of Ca”* to the phosphate group of the phospholipid 
was measured by *'P spectrum on the 850 MHz spectrometer equipped with a *'P 
direct-detection channel. Lipid nanodisc samples were prepared in buffer contain- 
ing 20 mM Tris-HCl, pH 7.0, 100-150 mM NaCl. Ca’* was titrated in with a molar 
ratio of Ca” lipid from 0 to 1.5. One-dimensional *'P spectra were measured with 
256 accumulated transients and processed with 25 Hz of line broadening. 


31. Delaglio, F. et a, NMRPipe: a multidimensional spectral processing system 
based on UNIX pipes. J. Biomol. NMR 6, 277-293 (1995). 
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Coordinated control of replication and transcription 
by a SAPK protects genomic integrity 


Alba Duch!, Irene Felipe-Abrio’, Sonia Barroso’, Gilad Yaakov', Maria Garcia-Rubio”, Andrés Aguilera’, Eulalia de Nadal! 


& Francesc Posas! 


Upon environmental changes or extracellular signals, cells are sub- 
jected to marked changes in gene expression '*. Dealing with high 
levels of transcription during replication is critical to prevent colli- 
sions between the transcription and replication pathways and avoid 
recombination events**. In response to osmostress, hundreds of 
stress-responsive genes are rapidly induced by the stress-activated 
protein kinase (SAPK) Hog] (ref. 6), even during S phase’. Here we 
show in Saccharomyces cerevisae that a single signalling molecule, 
Hog], coordinates both replication and transcription upon osmo- 
stress. Hog] interacts with and phosphorylates Mrc1, a component 
of the replication complex*". Phosphorylation occurs at different 
sites to those targeted by Mecl upon DNA damage*’. Mrcl phos- 
phorylation by Hog]! delays early and late origin firing by prevent- 
ing Cdc45 loading, as well as slowing down replication-complex 
progression. Regulation of Mrcl by Hogl is completely indepen- 
dent of Mecl and Rad53. Cells carrying a non-phosphorylatable 
allele of MRCI (mrc1*“) do not delay replication upon stress and 
show a marked increase in transcription-associated recombination, 
genomic instability and Rad52 foci. In contrast, mrcl*“ induces 
Rad53 and survival in the presence of hydroxyurea or methyl metha- 
nesulphonate. Therefore, Hog] and Mrcl define a novel S-phase 
checkpoint independent of the DNA-damage checkpoint that per- 
mits eukaryotic cells to prevent conflicts between DNA replication 
and transcription, which would otherwise lead to genomic instabi- 
lity when both phenomena are temporally coincident. 

High levels of transcription can induce genomic instability owing to 
conflicts between transcription and replication*®. SAPKs are key ele- 
ments in intracellular signalling networks that serve to respond and 
adapt to extracellular changes. Exposure of yeast to high osmolarity 
activates the p38-related SAPK Hogl, which reprograms the expres- 
sion capacity of the cell®’*”* and delays cell-cycle progression’*”, 
raising the question of how cells make massive changes in gene 
expression compatible with DNA replication. 

To identify molecular targets for Hog! in the replication complex 
(RC), a subset of 30 proteins involved in replication were subjected to 
an in vitro phosphorylation assay with active Hog]. Only Mrcl, which 
couples the DNA helicase and the DNA polymerase during replica- 
tion*"', was phosphorylated by Hog] (Fig. 1a). Mrcl was also in vivo 
phosphorylated upon osmostress in a Hogl-dependent manner 
(Fig. 1b). Mrcl has three MAPK consensus sites (T169, $215 and 
$229), and mutations of all three sites mostly abolished phosphoryla- 
tion by Hog] (Fig. la, b and Supplementary Fig. 2). These phosphor- 
ylation sites are different to those involved in the intra-S-phase DNA 
checkpoint*”. Then, we tested whether endogenous tagged Hog] and 
Mrcl interacted. Notably, Mrcl precipitated with Hog] and vice versa 
(Fig. 1c and Supplementary Fig. 3a). This interaction was confirmed by 
a two-hybrid assay (Supplementary Fig. 3b, c). These results indicate 
that Mrcl is a bona fide target of Hog]. 

To assess the relevance of Mrcl and its phosphorylation by Hog] in 
the cell cycle, cells were synchronized at the onset of S phase by using a 


temperature-sensitive CDC7 allele (cdc7-4) after release from phero- 
mone, and then released with or without osmostress (NaCl or sorbi- 
tol). S-phase progression was delayed in osmostressed cells (Fig. 2a) 
but the delay was not observed in mrc1™* cells (Fig. 2a). Similar results 
were obtained after release from «-factor or cdc7-4 synchronization 
(Supplementary Fig. 4a, b). Hogl can be activated by the Pbs2 
MAPKK'® (Pbs2?”). Overexpression of Pbs2?” also induced an 
S-phase delay in cdc7-4 cells that was abolished in mrc1** cells 
(Supplementary Fig. 5a, b). Clb5 is degraded after cells exit S phase’””®. 
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Figure 1 | Mrcl is a Hog! target. a, Hog! phosphorylates Mrec1 in vitro. 
Purified glutathione S-transferase (GST)—Mrc1 protein is phosphorylated by 
Hog] in an in vitro kinase assay, and this phosphorylation is abolished in GST- 
Mrc1*4, which harbours mutations of Thr 169, Ser 215 and Ser 229 to Ala. WT, 
wild type. b, Mrcl is a Hog] target in vivo. Mrcl-TAP pulled down from 
S-phase-synchronized cultures is phosphorylated by Hog] in osmostressed 
cells. Con, control; IP, immunoprecipitate. c, Hog] and Mrcl interact in vivo. 
Mrc1-TAP co-immunoprecipitates with Hog1-haemagglutinin (HA) and vice 
versa in in vivo pull-down assays. 
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Figure 2 | mrc1*“ bypasses the osmostress-induced S phase delay. a, b, cdc7- 
4(MRC1 or mrc1**) cells were pre-synchronized with o-factor («), washed and 
shifted to 37 °C to synchronize them at the onset of S phase (37 °C). Cells were 
released at 25 °C in yeast extract peptone dextrose (YPD) (control), NaCl or 
sorbitol, and cell-cycle progression was analysed by fluorescence-activated cell 
sorting (FACS) (a) or by western blot (b). a, mrc1™ cells cannot delay DNA 
replication upon osmostress. Vertical dotted line indicates the end of S phase 
(2C peak). The percentage of cells in S phase is shown at the top right of graphs. 
b, mrc1** cells show no delay in Clb5 degradation after release upon osmostress 
(NaCl). Bottom panels show normalized Clb5 quantification. Data represent 
the mean and standard deviation (s.d.) of three independent experiments. 


Correspondingly, wild-type (cdc7-4) cells subjected to osmostress 
delayed Clb5 degradation in contrast to mrc1*“ cells (Fig. 2b). Of note, 
mecl- or rad53-deficient cells delayed S-phase progression upon 
osmostress as efficiently as wild-type cells (Supplementary Fig. 6a). 
In addition, Rad53 was not phosphorylated upon osmostress, in con- 
trast to treatment with hydroxyurea (HU) (Supplementary Fig. 6b). 
This indicates that osmostress delays S phase independently of the 
S-phase checkpoint that responds to DNA damage. Corresponding- 
ly, cells containing the mrc1*“ allele blocked S-phase progression in 
response to HU, phosphorylated Rad53 and were resistant to HU and 
methyl methanesulphonate (MMS), showing an intact DNA-damage 
checkpoint (Supplementary Fig. 7). Therefore, phosphorylation of 
Mrcl by Hog] is essential for the S-phase delay upon osmostress but 
it is not required for the DNA-damage response. 

To characterize the mechanism by which Hog] phosphorylation of 
Mrcl delays S phase, we followed the binding of endogenous tagged 
Dpb2 at replication origins and adjacent regions. The Dpb2 subunit of 
the DNA polymerase € serves to monitor origin activation and RC 
progression’'. Dpb2 binding at the early origin ARS305 was similar in 
control and stressed cells, but was clearly delayed in adjacent regions 
upon osmostress (8 kilobase pairs (kb) and 17 kb). This delay was 
barely present in mrc1** cells (Fig. 3a and Supplementary Fig. 8). Ad- 
ditionally, recruitment of Dpb2 to the late origin ARS501 was delayed 
in wild-type but not in mutant cells (Fig. 3a and Supplementary Fig. 8). 
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Similar results were obtained when loading of Mrcl or Pol2 were 
assessed (Supplementary Figs 9 and 10a). 

We then analysed replication-fork progression by two-dimensional 
gel electrophoresis. Replication through the DNA region adjacent to 
ARS305 lasted longer in wild-type cells when compared to mrc1*“ 
cells, with the Y-arc persisting in the wild type while fading away in 
the mutant (Fig. 3b). Thus, Hog] phosphorylation of Mrcl delays fork 
progression upon stress. It has been suggested that the altered inter- 
action between Mrcl and the Pol2 subunit of DNA pol « delays 
S-phase progression upon replication stress”. Binding of Mrcl to 
Pol2 was reduced to less than 50% upon osmostress. In contrast, the 
association of Pol2 with the non-phosphorylatable mutant Mrcl*“ 
protein was not altered (Fig. 3c and Supplementary Fig. 11a). The 
association of Dpb2 with Mrcl or Pol2 was also not altered 
(Supplementary Fig. 11b). Therefore, osmostress leads to reduced 
binding of Mrcl to DNA Pol2, which might explain a decrease in 
RC progression. 

The quantification of the two-dimensional gels also suggested that 
firing of replication at ARS305, as indicated by the appearance of the 
bubble arc, was slightly delayed in the presence of stress in wild-type in 
contrast to mrc1*" cells. Then, we performed DNA-combing experi- 
ments using synchronous cultures to measure replication speed and 
RC progression (track length). Wild-type cells showed significantly 
shorter replication tracks upon osmostress (30 min) than mrc1** cells, 
suggesting that the latter was not able to delay early origin firing and 
RC progression (Fig. 3d). Similar results were observed in asynchron- 
ous cultures (Supplementary Fig. 12a). In addition, the assays per- 
formed at different time points (30 and 45min) (Fig. 3d and 
Supplementary Fig. 12b) showed that wild-type cells showed reduced 
speed of replication upon osmostress when compared to mrc1*“ cells 
(0.3 kb min‘ versus 0.57 kb min’). Loading of the Cdc45 helicase is 
essential for origin firing. Remarkably, binding of Cdc45 to early ori- 
gins was delayed in wild-type cells and in rad53 cells, but not in mrc1*4 
cells, upon osmostress (Fig. 3e and Supplementary Fig. 10c). Thus, 
Hog1 phosphorylation of Mrcl delays origin firing by preventing 
the association of Cdc45 with early origins. In contrast, HU does not 
alter the association of Cdc45 with early origins, which is consistent 
with HU not preventing early origin firing (Supplementary Fig. 10b). 
Although it is possible that the phosphorylation of Mrcl by Hog] 
could lead to recruitment of additional factors to regulate origin firing, 
Mrcl phosphorylation upon osmostress delays S-phase progression by 
altering early and late origin firing and possibly inhibiting fork pro- 
gression. 

We next asked how important it was for the cell to delay S-phase 
progression upon osmostress. To do this, we assayed chromosomal 
instability upon osmostress in wild-type or mrc1™* cells by a sectoring 
colour assay (Methods). mrcI*“ cells showed a marked increase in 
chromosomal instability in response to osmostress that was not 
observed in wild type (Fig. 4a and Supplementary Fig. 13a). Cor- 
respondingly, an MRCI mutant carrying Hog1-phosphomimic amino 
acids (mrc1*”) showed slower S-phase progression even in the absence 
of stress (Supplementary Fig. 14a), slower replication upon osmostress 
(Supplementary Fig. 14b) and did not have a significant increase in 
genomic instability upon osmostress, indicating that Mrcl phosphor- 
ylation by Hog] is essential to preserve genomic stability upon stress 
(Supplementary Fig. 13a). 

High levels of transcription can induce genomic instability owing to 
collisions between transcription and replication complexes”, and a 
major adaptive response to osmostress is the control of gene expres- 
sion by Hog] (ref. 6). It was therefore conceivable that concurrent 
transcription upon osmostress with ongoing replication could be pro- 
voking transcription-associated recombination (TAR)**”°. Thus, we 
asked whether TAR increased upon osmostress. A plasmid in which 
transcription of a leu2 direct-repeat recombination system was driven 
by the STL1 stress-responsive promoter in two different orientations 
(IN and OUT) with respect to the ARS209 (ARSH4) origin allowed 
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Figure 3 | Hog] delays replication by inhibiting fork progression and origin 
firing. a-e, Wild-type (WT) and mrc1”“ cells were synchronized as in Fig. 2. 
a, Dpb2 binding at the indicated regions was determined by chromatin 
immunoprecipitation (ChIP). The graph shows the kinetics of Dpb2 after 
release in YPD (Con) or osmostress (NaCl). b, Two-dimensional gel 
electrophoresis analysis of replication-fork progression at early origin ARS305 
or at adjacent regions. Quantified data are indicated in top right corners as 
percentage of replication forks (adjacent region) or as percentage of bubbles 


us to assess recombination of the Jeu2 direct repeat (Fig. 4b, 
Supplementary Fig. 13b and Methods). In the absence of stress, neither 
the wild-type nor the mrc1** cells showed any TAR. However, upon 
stress, TAR was strongly induced only in mrc1*“ cells when transcrip- 
tion and replication progressed towards each other (IN). Of note, cells 
deficient in the HOT] transcription factor, which drives the induction 
of STLI, did not show TAR in mrc1** cells (Supplementary Fig. 13c). 
Therefore, phosphorylation of Mrcl is a key event to prevent TAR. 
Correspondingly, osmostressed mrc1* cells showed an increase in 
Rad52 foci when compared to wild type (Fig. 4c), as expected for an 
increase in Rad52-engaged recombination events. This demonstrates 
that phosphorylation of Mrcl by Hog] is important for reducing 
recombination events at genomic loci. 

The genomic instability in mrc1** cells upon osmostress does not 
render cells osmosensitive. Therefore, there is probably a mechanism 
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among total replication forks (ARS305). ND, not determined. c, Co- 
immunoprecipitation experiments show that Hog! phosphorylation of Mrc1 
upon osmostress alters association with Pol2. IP, immunoprecipitate. d, DNA- 
combing analysis of replication forks in wild-type and mrc1™* cells in the 
absence or presence of osmostress (30 min). Graph indicates the distribution of 
BrdU track length (kb). *P < 0.05, **P < 0.005. e, Hog] phosphorylation of 
Mrcl delays Cdc45 binding at the early origin ARS305. Data show the mean and 
s.d. of three independent experiments. 


that prevents the lethal accumulation of the molecular events respon- 
sible for genomic instability. Deletion of RAD53 did not affect cell 
growth in the presence of osmostress; however, its deletion in com- 
bination with mrc1 or mrc1*“ resulted in osmosensitive cells (Fig. 4d). 
Therefore, transcription-replication collisions occurring in the 
absence of a cell-cycle delay upon osmostress depends on the DNA- 
damage checkpoint to maintain viability. 

Stress induces a rapid and transient response in gene expression’ to 
maximize cell survival. However, when stress occurs during S phase, an 
increase of gene expression poses the risk of collision between the 
replication and transcription machineries, which can only be pre- 
vented if DNA replication is delayed while transcription lasts. To 
coordinate transcription and replication, a single signal transduction 
kinase, Hog], is able to target both processes by inducing adaptive 
transcription while delaying replication (Supplementary Fig. 1). 
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Figure 4 | Hogl—Mrcl checkpoint prevents genomic instability upon 
osmostress. a, mrcI™ cells show an increased frequency of plasmid loss upon 
osmostress. S-phase cells (wild type (WT) and mrcl 34) were plated on YPD 
(Con) or YPD plus NaCl (osmostress), and red colonies were quantified. 

b, Schematic of the IN/OUT vectors used in the TAR assays. The graph shows 
TAR frequency in wild-type (MRC1) or mrc1 34 cells in absence (control) or 
presence of osmostress (NaCl). c, Rad52 foci in response to NaCl. Wild-type 
and mrc1™* cells carrying RAD52-YFP were scored for foci (Con) or after 4h of 
osmostress (NaCl). a-c, Data represent the mean and s.d. of four independent 
experiments. d, mrc1*“ cells are osmosensitive when the Rad53-dependent 
checkpoint pathway is impaired. 


The known S-phase checkpoint is a surveillance mechanism that 
inhibits and protects chromosome replication in response to internal 
signals that arise from the replication process and DNA damage. We 
show that Hog] and Mrcl define an alternative S-phase checkpoint to 
integrate extracellular stimuli into DNA replication, ensuring proper 
coordination of transcription and replication upon stress. This coor- 
dination prevents the genomic instability that would otherwise be 
caused by collision of the two machineries. Remarkably, the two path- 
ways share Mrcl to delay S-phase progression. Mrcl has two sets of 
phosphorylation sites, some targeted by Mecl that lead to Rad53 
activation and delay late origin firing, and some targeted by Hog] that 
alter Cdc45 association to early origins and delay both early and late 
origin firing (Supplementary Fig. 15). Therefore, not only is the input 
signal received by Mrcl different for Mecl and Hog], but the outcome 
of phosphorylation by the two kinases is also different, thus defining 
completely independent pathways from top to bottom. 

There are many extracellular stimuli that require the rapid induction 
of a vast number of genes to maximize cell survival. These changes in 
gene expression need to be compatible with replication. This study 
provides insights about how cells can deal with high transcription rates 
while replicating, which might reflect a common need in eukaryotic cells. 


METHODS SUMMARY 


Genetic methods, strains and plasmids and biochemical experiments are described 
in Methods. 
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Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

DNA plasmids and yeast strains. Plasmid pAD88 (MRCI1) was obtained by 
cloning the MRC1 open reading frame (ORF) and its promoter (323 bp upstream 
from the ORF) in the BamHI site of the episomal vector pRS415. pAD66 (mrc1*“) 
and pAD79 (mrc1°*”) were obtained by sequential site-directed mutagenesis of 
pAD88. In these mutant constructs, Thr 169, Ser 125 and Ser 229 are mutated to 
Ala or Asp, respectively. To obtain the integrative constructs pAD74 (MRC1) and 
pAD75 (mrcl 34) and pAD103 (mrc1°”), ORFs and their promoters were isolated 
from pAD88, pAD66 and pAD79, respectively, and cloned into the pRS406 integ- 
rative vector. pAD57 (MRC1) was obtained by cloning MRC1 in the BamHI site of 
the pGEX-6P-1 vector in-frame of the GST N-terminal tag. pAD67 (mrc1**) was 
obtained by cloning the mrc1*“ allele isolated from pAD66 by BamHI digestion 
and cloned into the BamHI site of the pGEX-6P-1 vector in-frame of the GST 
N-terminal tag. STLI-IN and STL1-OUT vectors are based on the GAL-IN and 
GAL-OUT vectors, respectively’, and were modified by swapping the GALI pro- 
moter with the Hog1-dependent STL1 promoter. STL1 expression is controlled by 
the Hotl transcription factor. To detect Rad52 foci pWJ1344 carrying RAD52- 
YFP” was used. For the two-hybrid assay, pAD99 was obtained by cloning the full- 
length MRC1 ORF in the bait vector pBTM116. pACTII-Hog] was also used in the 
two-hybrid assay”. 

The strains used in this work were derived from W303 (MATa, his3 leu2 trp1 
ura3 ade2 can1) or its mutant cdc7-4 (MATa, his3 leu2 trp1 ura3 ade2 can1, cdc7- 
ts4), except for the genomic instability assay, in which Yph277 (ref. 29) was used. 
The cdc7-4 allele was characterized previously*®. The following strains were used to 
follow cell cycle progression by FACS and western blot: YAD93 (MATa his3 leu2 
trp1 ura3 ade2 can] cdc7-ts4 mrc1::KanMxX, Clb5-TAP) transformed with pRS415, 
pAD88 (pRS415-MRC1), pAD66 (pRS415-mrc1**) or pAD79 (pRS415-mrc1*”), 
YAD125 (cdc7-ts4 mrcl1A, pAD74 (pRS406-MRC1)) and YAD126 (cdc7-ts4 
mrc1A, pAD75 (pRS406- mrc1*“)). In the ChIP experiments we used the following 
strains: YAD127 (cdc7-ts4 mrc1A, pAD74 (pRS406-MRC1) DPB2-HA), YAD128 
(cdc7-ts4 mrc1A, pAD74 (pRS406- mrc1*“) DPB2-HA), YAD192 (cdc7-ts4 mrc1A, 
pAD74 (pRS406- mrcl?”) DPB2-HA), YAD113 (cdc7-ts4 mrc1/, pAD74 (pRS406- 
MRC1) CDC45-HA), YAD179 (cdc7-ts4 mrc1A, pAD74 (pRS406- mrc1*“) CDC45- 
HA), YAD190 (sml1A rad534 CDC45-HA), YAD155 (cdc7-ts4 MRCI-HA), 
YAD184 (cdc7-ts4 mrc1A, pAD74 (pRS406-MRC1) POL2-HA), YAD185 (cdc7- 
ts4 mrc1A, pAD74 (pRS406- mrc1**) POL2-HA). For immunoprecipitation assays, 
the strains YAD134 (HOGI-HA MRCI-TAP), YAD103 (MRCI-TAP), YAD122 
(POL2-HA MRCI-MYC), YAD137 (POL2-HA) and YAD168 (POL2-HA mrc1*“- 
MYC) were used to assay the interaction between DNA Pol2 and Mrel or Mrc1**. 
YAD144 (sml14 mec14 MRCI-TAP) and YAD105 (MRCI1-TAP hog1) were used 
to assay the interaction between Hog] and Mrcl. All tags (HA, TAP and Myc) were 
integrated at the C terminus of the gene (CLB5, DPB2, MRC1, HOGI, POL2, 
CDC45). YAD125 and YAD126 were also used for two-dimensional gel electro- 
phoresis of replication forks and recombination assays to detect TAR. YAD192 
(cdc7-ts4 mrc1A, pAD75 (pRS406- mrc1*“) hot1A::KANMX) was also used in the 
TAR assays. In the latter case, these strains were transformed with the STL1-IN and 
STL1-OUT plasmids. For Rad52 foci experiments, the W303-1BR5 (ref. 31) and 
WMRS (MAT ade2-1 can1-100 his3-11,15 leu2-3,112 trp1-1 ura3-1 mrclA:: 
KanMX, KanMX4::NAT pAD75 (pRS406 mrcI*“):;URA3 RAD5+) strains were 
used. Osmosensitivity assays were performed with YAD143 (sml14 rad534 
mrc1A) carrying the pRS415, pAD66 or pAD88 plasmids. For DNA combing, 
strains were modified to allow incorporation of exogenous BrdU into genomic 
DNA. Seven copies of the herpes simplex TK gene under the control of the yeast 
GDP promoter were inserted at the URA3 locus. The resulting strains used in the 
DNA-combing experiments were: WRBMb-6C (MATa ade2-1 can1-100 his3- 
11,15 leu2-3,112 trp1-1 bar1A:: MRCI::TAP:;KAN RRM3::FLAG::KAN ura3:: 
URA3/GDP-TK(7X)) and WRB3Ab-6C (MATa ade2-1 can1-100 his3-11,15 
leu2-3,112 trpl-1 bar1A:: mrcI**::TAP:;KAN RRM3::FLAG::KAN ura3:: URA3/ 
GDP-TK(7X)). 

Expression and purification of recombinant proteins. Escherichia coli bacteria 
were grown at 37°C to an OD¢oonmof0.5. Then GST-tagged proteins were 
induced for 6 h by adding 1 mM isopropylthiogalactoside (IPTG) at 25 °C. After 
induction cells were collected by centrifugation and resuspended in 1/50 volume of 
STET 1X buffer (100 mM NaCl, 10 mM Tris-HCl pH 8.0, 10 mM EDTA pH 8.0, 
5% Triton X-100 supplemented with 2 mM dithiothreitol (DTT) and 1 mM phe- 
nylmethylsulphonyl fluoride (PMSF), 1 mM benzamidine, 200 tg ml leupeptin 
and 200 pg ml’ pepstatin). Cells were lysed in ice-cold sonication and cleared by 
high-speed centrifugation. GST-fused proteins were pulled down from superna- 
tants with 300 ul of 4B gluthatione-sepharose beads (GE Healthcare, 50% slurry 
equilibrated with STET) by mixing for 90 min at 4 °C. The gluthatione-sepharose 
beads were collected by brief centrifugation and washed four times in STET 1X 
buffer and two times in 50 mM Tris-HCl pH 8.0 buffer supplemented with 2 mM 
DTT. The GST-fused proteins were then eluted in 200 ul of 50 mM Tris-HCl pH 


8.0 buffer supplemented with 2 mM DTT and 10 mM reduced gluthatione (Sigma) 
by rotating for 20 min at 4 °C and stored at —80 °C. 

Kinase assays. GST-MRC1 and GST-mrc1*™* were expressed in E. coli and purified. 
Eluted proteins were mixed with 1 ig of purified GST-Hog1 (pre-activated with 
GST-PBS2"") in 1X kinase buffer (50 mM Tris-HCl pH 7.5, 10 mM MgCl, 2mM 
DTT) supplemented with 100 1M cold ATP and y*?-ATP (5M). The mix was 
incubated for 20 min at 37°C and samples were resolved using SDS-PAGE, the 
proteins were visualized by Coomassie staining, and phosphorylation was detected 
using X-OMAT (Kodak) films. 

Western blotting. TCA protein extracts were resolved using SDS-PAGE. Total 
amounts of the indicated proteins were detected by immunobloting using the 
indicated antibodies with the ECL detection reagent (Amersham). For quantifica- 
tions, the corresponding films were scanned at 16 bits per channel and quantified 
using QuantityOne Application Software version 4.6.1. 

Immunoprecipitation assays. The selected strains were grown to mid-log expo- 
nential phase (OD¢60nm = 1) and stressed or not with 0.4M NaCl for 7 min at 
25 °C. Cells were collected (400 ml per condition) and kept at —80 °C, and pellets 
were resuspended in 2 ml of lysis buffer (45 mM HEPES-KOH pH 7.2, 150 mM 
NaCl, 1mM EDTA, 10% glycerol, 0.2% NP40) containing the whole cocktail of 
antiproteases and phosphatase inhibitors. An equal volume of glass beads (of 
diameter 0.5mm) was added, and cells were broken by vortexing at 4°C. The 
whole extract was clarified by centrifuging for 10 min at 10,500g and an aliquot was 
taken for further analysis. 3-7 mg of the extracts were incubated overnight with 
anti-Myc monoclonal antibody (9E10) or with anti-HA affinity matrix (Roche), 
and the beads were washed five times with the lysis buffer. Antibody-bound 
fractions and the corresponding whole-cell extract were boiled in SDS-containing 
sample buffer and loaded in 8% acrylamide gels. 

Growth conditions and FACS analyses. Overnight cultures were diluted to an 
ODg60nm Of 0.3 and grown for 3 h at 25 °C in YPD. cdc7-4 cells were synchronized 
at the onset of S phase in two step. First, they were incubated with «-factor for 2 hat 
25 °C (40 pg ml '), washed and then released at 37 °C (incubated for 2 h) to block 
the cell cycle at the beginning of S phase, before DNA replication starts. Second, 
they were released from the temperature block, and the cells were allowed to 
progress into S phase at 25°C, in YPD, supplemented or not with 0.4M NaCl 
or 0.8 M sorbitol. For the GAL1::Pbs2”” overexpression experiments, cells were 
grown overnight in synthetic, minimal medium (SD) with 2% raffinose. The GALI 
promoter was induced with 2% galactose after 90 min at 37 °C, and left for an extra 
90 min before release from the temperature block. For FACS, cells were fixed in 
ethanol, treated overnight with RNase A at 37 °C in 50 mM sodium citrate, stained 
with propidium iodide and analysed in a FACScan flow cytometer (Becton 
Dickinson). A total of 10,000 cells were analysed and the population of G, quan- 
tified for each time point using WinMDI 2.9. 

ChIP assays. In ChIP experiments, cells were treated as explained above, but they 
were released into S phase at 16 °C to assess Dpb2-HA and Mrcl-HA binding 
onto DNA sequences. Cells (50 ml per point) from cell-cycle time courses were 
collected at OD¢60 nm = 0.7 were treated with 1% formaldehyde for 20 min at room 
temperature. Glycine was added to a final concentration of 330mM and the 
incubation continued for 15 min. Cells were collected, washed four times with 
cold TBS (20 mM Tris-HCl, pH 7.5, 150 mM NaC)), and kept at —20 °C for further 
processing. Cell pellets were resuspended in 0.3 ml cold lysis buffer (50 mM 
HEPES-KOH, pH 7.5, 150 mM NaCl, 1 mM EDTA, 0.1% sodium deoxycholate, 
0.1% SDS, 1 mM PMSF) supplemented with 1% Triton-X 100. An equal volume of 
glass beads (of diameter 0.5 mm) was added, and cells were disrupted by vortexing 
(with vortex gene) for 13 min on ice. The lysate was diluted into 0.6 ml lysis buffer, 
and the glass beads were discarded. The crosslinked chromatin was sonicated to 
yield an average DNA fragment size of 350 bp (range, 100-850 bp). Finally, the 
sample was clarified by centrifugation at 15,500g for 5 min at 4°C. Chromatin 
solution (600 I) was incubated with 50 yl anti-HA monoclonal antibody pre- 
coupled to anti-mouse IgG-conjugated paramagnetic beads (Dynabeads M-450, 
Dynal). After 90 min at 4°C on a rotator, beads were washed twice for 4 min in 
1ml lysis buffer, twice in 1 ml lysis buffer with 500mM NaCl, twice in 1 ml 
washing buffer (10mM Tris-HCl pH 8.0, 0.25M LiCl, 1mM EDTA, 0.5% 
N-P40, 0.5% sodium deoxycholate) and once in 1 ml TE (10mM Tris-HCl pH 
8.0, 1 mM EDTA). Immunoprecipitated material was eluted twice from the beads 
by heating for 10 min at 65°C in 50 pil elution buffer (25 mM Tris-HCl pH 7.5, 
10mM EDTA, 0.5% SDS). To reverse crosslinks, samples were adjusted to 0.3 ml 
with elution buffer and incubated overnight at 65 °C. After extraction with phenol- 
chloroform-isoamyl alcohol and chloroform, DNA was ethanol precipitated for 
4h at —20°C in the presence of 20 pg glycogen, and resuspended in 30 il of TE 
buffer. For real-time PCR, oligonucleotides for the ARS305 and ARS501 origin 
sequences, and for sequences located at 8 kb or 17 kb downstream of the ARS305 
origin were used. As an internal PCR control, oligonucleotides for telomeric 
regions were used. 
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Two-dimensional gel electrophoresis. Cells growing exponentially in YPD at 
25 °C were synchronized in G, phase with -factor (3 ug ml!) for 2h 30min, 
washed free of «-factor and immediately shifted to 37°C for 2h to synchronize 
cells in S phase. HU was added at a final concentration of 40 mM 30 min before the 
shift to 25 °C. The culture was treated with 0.4 M NaCl and shifted back to 25 °C. 
Total DNA from 50ml of mid-log-phase cells was isolated as previously 
described*'. DNA was restricted with EcoRV and/or HindIII. Two-dimensional 
gel electrophoresis was performed as previously described*’. For hybridization, the 
coordinates of the «.* P PCR probes used were 32170-35159 to detect the HindIII- 
HindIII fragment and 37883-41883 for the EcoRV-HindIII fragment of ARS305 
on chromosome III. Signals were quantified using a Phosphorimager Fujifilm 
FLA-5100 and the ImageGauge program. For two-dimensional gel quantification 
signals were quantified using a Phosphorimager Fujifilm FLA-5100 and the Image 
Gauge program. The ratio of bubbles (percentage) was normalized with respect to 
the total amount of replicating molecules to determine the relative delay of rep- 
lication fork initiation. Quantification of the replicative intermediates in the frag- 
ment adjacent to ARS305 was normalized with respect to the total amount of 
signals present in the gel, including linear monomers (7). 

Rad52 foci. Spontaneous and NaCl-induced Rad52-YFP foci were counted in 
4',6-diamidino-2-phenylindole (DAPI)-stained nuclei from S-G, mid-log cells 
bearing the plasmid pWJ1344. For NaCl treatment, cells were incubated in med- 
ium supplemented with NaCl ata final concentration of 0.8 M for an additional 4h 
before the quantification of the Rad52-YFP foci, which were visualized with a 
Leica DC 350F microscope. 

Genomic instability assay. Overnight cultures of the Yph277 strain transformed 
with plasmids harbouring the wild type or the mrcl™ mutant were diluted to an 
ODeeonm Of 0.3, grown in YPD until an OD¢¢e0nm Of 0.5, and synchronized with 
a-factor for 90 min at 30 °C (40 pg ml © 1) Cells were washed and released into S phase 
in YPD for 30 min and 200 ul of cells (1/1,000 diluted) of each culture were plated in 
YPD or YPD with 0.8 M NaCl, and incubated at 30 °C for 3 days and left at 4 °C. 
Recombination assays. cdc7-4 cells (wild type and mrc1**) harbouring STL1-IN or 
STL1-OUT vectors (which contain the origin of replication ARS209 or ARSH4 as 
named in the scheme) were synchronized as described above and 200 tl of S-phase 
cells (1/100 diluted) were plate in SD Trp’ plates (control plates) or in SD Trp ~ 
Leu plates (recombination plates), supplemented or not with 0.8 M NaCl. 
Two-hybrid assay and f-galactosidase assay. The two-hybrid analysis was car- 
ried out by using pACTII as the activation domain plasmid and pBTM116-LexA 
as the DNA-binding domain plasmid. The plasmid LexA-Mrcl was cotrans- 
formed with pACT-Hog]l using the L40 (MATa trp1 leu2 his3 LYS::lexA-HIS3 
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URA3::lexA-LacZ) reporter strain. We performed the corresponding negative 
controls (using the empty vectors) and positive controls (the well-known inter- 
actors LexA-Raf and pACT-Ras) in parallel. Positive clones were selected and 
further tested for §-galactosidase activity as follows. The transformed yeast strains 
were grown selectively until mid-log phase in the appropriate selective liquid 
media and then diluted in YPD for 3h. Logarithmically growing cells 
(OD660 nm = 0.5-0.8) were treated with 0.4 M of NaCl for 35 min and permeabi- 
lized by ethanol-toluene treatment, and B-galactosidase activity was determined as 
light intensity at OD4i5 nm after the addition of o-nitrophenil-p-p-galactoside 
(ONPG ) substrate. 

DNA combing. Mid-log phase cells were synchronized with «-factor and incu- 
bated for 20min in 40mM HU before adding 200 pg ml”! BrdU (Sigma) and 
0.4M NaCl. DNA combing was performed as described**. DNA fibres were 
extracted in agarose plugs after 30 and 45 min of BrdU labelling, corresponding 
to 50 and 65 min after one release, and were stretched on silanized coverslips. DNA 
molecules were counterstained with an anti-ssDNA antibody (MAB3868, 
Chemicon; 1/100) and a goat anti-mouse antibody coupled to Alexa 546 
(A11030, Molecular Probes, 1/50). BrdU was detected with BU1/75 (AbCys, 
1/20) anti-BrdU antibody. DNA fibres were analysed in a Leica DM6000 micro- 
scope equipped with a DFC390 camera (Leica). Data acquisition was performed 
with LAS AF (Leica). Velocity was estimated as the difference between the median 
values of the track length of the 45 and 30 min time points divided by 15. In 
asynchronous culture the combing assay was performed by incubating mid-log 
phase cells for 45 min in medium with 200 pg ml’ BrdU with or without 0.4M 
NaCl. 


27. —Lisby,M., Mortensen, U.H. & Rothstein, R. Colocalization of multiple DNA double- 
strand breaks at a single Rad52 repair centre. Nature Cell Biol. 5, 572-577 
(2003). 

28. Nadal, E., Casadome, L. & Posas, F. Targeting the MEF2-like transcription factor 
Smp1 by the stress-activated Hog] mitogen-activated protein kinase. Mol. Cell. 
Biol, 23, 229-237 (2003). 

29. Spencer, F. et a/. Mitotic chromosome transmission fidelity mutants in 
Saccharomyces cerevisiae. Genetics 124, 237-249 (1990). 

30. Bousset, K. & Diffley, J. F. The Cdc7 protein kinase is required for origin firing 
during S phase. Genes Dev. 12, 480-490 (1998). 

31. Moriel-Carretero, M. & Aguilera, A. A postincision-deficient TFIIH causes 
replication fork breakage and uncovers alternative Rad51- or Pol32-mediated 
restart mechanisms. Mol. Cell 37, 690-701 (2010). 

32. Bianco, J. N. et al. Analysis of DNA replication profiles in budding yeast and 
mammalian cells using DNA combing. Methods 57, 149-157 (2012). 


©2013 Macmillan Publishers Limited. All rights reserved 


| sid ial Be 


doi:10.1038/nature11658 


DNA-repair scaffolds dampen checkpoint signalling 
by counteracting the adaptor Rad9 


Patrice Y. Ohouol, Francisco M. Bastos de Oliveira', Vi Liu’, Chu Jian Ma! & Marcus B. Smolka! 


In response to genotoxic stress, a transient arrest in cell-cycle pro- 
gression enforced by the DNA-damage checkpoint (DDC) signal- 
ling pathway positively contributes to genome maintenance’. 
Because hyperactivated DDC signalling can lead to a persistent 
and detrimental cell-cycle arrest’, cells must tightly regulate the 
activity of the kinases involved in this pathway. Despite their 
importance, the mechanisms for monitoring and modulating 
DDC signalling are not fully understood. Here we show that the 
DNA-repair scaffolding proteins Slx4 and Rtt107 prevent the aber- 
rant hyperactivation of DDC signalling by lesions that are generated 
during DNA replication in Saccharomyces cerevisiae. On replica- 
tion stress, cells lacking Slx4 or Rtt107 show hyperactivation of 
the downstream DDC kinase Rad53, whereas activation of the 
upstream DDC kinase Mec] remains normal. An Slx4-Rtt107 com- 
plex counteracts the checkpoint adaptor Rad9 by physically inter- 
acting with Dpb11 and phosphorylated histone H2A, two positive 
regulators of Rad9-dependent Rad53 activation. A decrease in DDC 
signalling results from hypomorphic mutations in RAD53 and H2A 
and rescues the hypersensitivity to replication stress of cells lacking 
Slx4 or Rtt107. We propose that the Slx4-Rtt107 complex modu- 
lates Rad53 activation by a competition-based mechanism that 
balances the engagement of Rad9 at replication-induced lesions. 
Our findings show that DDC signalling is monitored and modu- 
lated through the direct action of DNA-repair factors. 

Slx4 is an evolutionarily conserved DNA-repair scaffolding protein 
that is important for the cellular response to exogenous DNA- 
damaging agents*’, and mutations in human SLX4 were recently 
linked to Fanconi anaemia*”. In S. cerevisiae (budding yeast), cells that 
lack SLX4 (slx4A cells) are highly sensitive to methyl methanesulpho- 
nate (MMS)’, a DNA-alkylating agent that blocks replication and 
induces the DDC pathway. While investigating the activation status 
of the S. cerevisiae DDC kinase Rad53 in slx4A cells, we noted that MMS 
treatment leads to hyperphosphorylation, and thus hyperactivation, of 
Rad53 compared with in wild-type cells (Fig. la and Supplementary 
Fig. 1a), which is consistent with a previous report'®. However, the 
phosphorylation of histone H2A (also known as Htal and Hta2), a 
substrate of the upstream DDC kinase Mecl, at serine 129 (here referred 
to as H2A?S!”*) was not increased in slx4A cells (Fig. 1a, lower panel). 
These results suggest that the hyperactivation of Rad53 in slx4A cells is 
not caused by increased damage-induced Mecl signalling but by 
improper downstream regulation of Rad53 activation. To test this pos- 
sibility, we compared the phosphoproteome of wild-type and slx4A cells 
after MMS treatment, using quantitative mass spectrometry. Although 
most of the detected Mecl targets were phosphorylated to the same 
extent in both cell types, Rad53-dependent phosphorylation was sig- 
nificantly higher in slx4A cells than in wild-type cells (Supplementary 
Fig. 1b), further supporting the idea that Slx4 has a role in specifically 
blocking Rad53 hyperactivation. Because the activation of Rad53 in 
response to MMS mostly depends on the checkpoint adaptor Rad9 
(Fig. 1b and Supplementary Fig. 2), Slx4 probably counteracts Rad9- 
dependent Rad53 activation. 


To test whether the sensitivity of slx4A cells to MMS is caused 
mostly by aberrant Rad53 hyperactivation, we used hypomorphic 
alleles of rad53 that result in lower Rad53 activation, reasoning that 
these alleles would rescue the MMS sensitivity of slx4A cells. Rad53 has 
two FHA domains, which bind to phosphorylated Rad9 in a redundant 
manner and mediate Rad53 activation'' (Fig. 1c). Mutations in the 
FHA2 domain promote a stronger reduction in MMS-induced Rad53 
activation than do mutations in the FHA1 domain’’. Whereas a muta- 
tion (R70A, in which arginine is substituted with alanine at amino acid 
70) in the FHA1 domain of Rad53 had no effect on the MMS sensitivity 
of slx4A cells, a mutation (R605A) in the FHA2 domain reduced the 
MMS sensitivity of slx4A cells (Fig. 1d). Consistent with our hypothesis 
that Rad53 hyperactivation is the cause of the MMS sensitivity of slx4A 
cells, mutation of the FHA2 domain resulted in a decrease in Rad53 
activation in s/x4A cells to a level similar to that of wild-type cells 
(Fig. le). Collectively, these results suggest that Slx4 has a crucial role 
in preventing excessive Rad9-dependent activation of Rad53 (Fig. 1f). 
The levels of MMS used here require that cells pass through the S phase 
of the cell cycle for Rad53 to become active’; therefore, our results 
suggest that Slx4 counteracts the Rad9-Rad53 pathway in response 
to replication-induced lesions. The finding that combined deletion 
of the SLX1 and RAD1 genes, which encode nucleases that are known 
to associate with Slx4, leads to lower MMS sensitivity and Rad53 
activation than in slx4A cells (Supplementary Fig. 4) supports a nuclease- 
independent function for Slx4 during the cellular response to MMS- 
induced replication stress. 

We recently reported that on replication stress, Slx4 binds to Dpb11 
(ref. 14), a replication factor that is involved in DDC activation’>”®. 
Because Dpb11 binds to Rad9 and positively regulates Rad9-dependent 
Rad53 activation'”"*, we assessed whether the Slx4-Dpb11 interaction 
has a role in counteracting Rad53 activation during MMS treatment. 
We previously showed that phosphorylation of Slx4 by Mecl mediates 
the interaction of Slx4 with Dpb11 and that an Slx4 mutant lacking 
seven Mecl consensus phosphorylation sites (Slx4-7MUT) cannot 
stably interact with Dpb11 (ref. 14). Rad53 is hyperactivated in slx4- 
7MUT cells (Fig. 2a), supporting a model in which the Slx4-Dpb11 
interaction is important for preventing Rad53 hyperactivation. 

Next, we tested whether the Slx4—Dpb11 interaction inhibits the 
ability of Rad9 to bind to Dpb11 in wild-type and s/x4A cells. Dele- 
tion of SLX4 leads to a significant increase in the MMS-induced inter- 
action between Dpb11 and Rad9 (Fig. 2b and Supplementary Fig. 5), 
suggesting that Slx4 and Rad9 compete for Dpb11 binding. Dpb11 
contains two pairs of BRCT domains, which bind to phosphorylated 
motifs. We found that recombinant BRCT domains 1 and 2 (BRCT’””) 
of Dpb11 can bind phosphorylated Slx4 and phosphorylated Rad9 from 
MMS-treated S. cerevisiae lysates (Fig. 2c). This finding is consistent 
with a model in which Slx4 and Rad9 compete for BRCT’” binding. 

Dpb11 BRCT” was previously shown to interact directly with 
cyclin-dependent kinase (CDK)-dependent phosphorylation sites in 
Sld3, thereby initiating DNA replication’. To better understand 
the mechanism of the Slx4-Dpb11 interaction, we searched for a 
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Figure 1 | Slx4 counteracts Rad9-dependent Rad53 activation. a, Western blot 
showing phosphorylation of haemagglutinin (HA)-tagged Rad53 and histone 
H2A?S!’ after MMS treatment. b, Western blot showing the phosphorylation 
status of Rad53-HA in the indicated strains after MMS treatment. c, Schematic 
representation of the Rad53 protein and its interaction with Rad9. d, MMS 

sensitivity assay of strains containing the indicated Flag-tagged mutant Rad53 


CDK-targeting motif in Slx4 that resembles the Dpb11-binding region 
in Sld3. Notably, Slx4 contains proline-directed phosphorylation sites 
that align well with serine 600 (S600) and S622 of Sld3, which are 
important for the Dpb11-Sld3 interaction’””® (Supplementary Fig. 6). 
We tested the importance of the proline-directed sites within the 
Sld3-like region of Slx4 and found that $486 of Slx4 is crucial for the 
MMS-induced interaction with Dpb11 (Fig. 2d). In addition, three 
canonical Mec1 phosphorylation sites (S/T-Q sites) that are important 
for mediating a robust Slx4—Dpb11 interaction” are located in or near 
the Sld3-like Dpb11-binding region in Slx4. We conclude that the 
Slx4-Dpb11 interaction is mediated by the coordinated action of both 
Mecl and a proline-directed kinase at a motif that is probably targeted 
by BRCT’” of Dpb11. Although we detected residual binding between 
Dpb11 and Slx4-7MUT, we did not detect an interaction between 
Dpbli and Slx4°“°**, suggesting that the phosphorylation of $486 
has a more important role in mediating the Slx4—Dpb11 interaction 
than the Mecl consensus phosphorylation sites. This is also supported 
by the finding that six4°*°° leads to Rad53 hyperactivation (Fig. 2e) 
and to a higher MMS sensitivity than does slx4-7MUT (Fig. 2f). 
Interestingly, previous reports showed that proline-directed sites in 
Rad9 are also important for the Dpb11-Rad9 interaction’”'* (Sup- 
plementary Fig. 7). 

To further test the model that Slx4 antagonizes Rad53 activation by 
binding to Dpb11 and outcompeting Rad9, we overexpressed Slx4 
using an ADH1 or a TDH3 promoter and monitored different steps 
of checkpoint activation. In an early step in checkpoint activation, 
Rad9 assembles into a ternary complex with Dpb11 and Mec] and is 
hyperphosphorylated by Mec] (ref. 18). We first monitored the phos- 
phorylation status of Rad9 in cells expressing Slx4 from its endogenous 


proteins. Similar results were obtained with strains containing untagged Rad53 
(Supplementary Fig. 3). e, Western blot showing the phosphorylation status of 
Rad53-Flag in the indicated strains after MMS treatment. f, Model for the role 
of Slx4 in uncoupling Rad53 activation from Mec] signalling, through 
counteracting Rad9. Rad53-p, phosphorylated Rad53; WT, wild type. 


promoter and in cells overexpressing Slx4. Overexpression of wild- 
type Slx4 but not $486A mutated Slx4 significantly inhibited the MMS- 
induced hyperphosphorylation of Rad9 (Fig. 2g), as shown by the 
strong decrease in the slower migrating band at 45 and 60 min after 
release from o-factor arrest. We then monitored the effect of Slx4 
overexpression on Rad53 activation in response to MMS treatment 
(Supplementary Fig. 8a). Although overexpression of Slx4 leads to a 
small but consistent decrease in Rad53 activation early in the response, 
we did not observe the same effect at later time points. We speculated 
that after 40 min, the Rad53 activation that was observed in Slx4- 
overexpressing cells was being mediated by a parallel mechanism 
based on Dot1-mediated histone H3K79 methylation, which can pro- 
mote the recruitment of Rad9 to lesion sites independently of Dpb11 
(ref. 21). We therefore monitored Rad53 activation in cells lacking 
DOTI and found that Slx4 overexpression significantly reduces 
Rad53 activation also at later time points (Fig. 2h); this effect depends 
on S486 of Slx4 (Supplementary Fig. 8b, c). Next, we determined 
whether Slx4 overexpression specifically disrupts the Rad9-Dpb11 
interaction. We detected hyperphosphorylated Rad9 in a Dpb11 pull- 
down from cells expressing Slx4 from its endogenous promoter, but we 
did not detect Rad9 ina Dpb11 pull-down using cells that overexpressed 
Slx4 (Fig. 2i). This effect was specific for S486 of Slx4. These findings 
support our model that Slx4 counteracts DDC signalling by binding to 
Dpb11 and preventing its stable interaction with Rad9 (Fig. 2)). 

Slx4 forms a tight complex with the BRCT-domain-containing pro- 
tein Rttl07, which is a DNA-repair scaffold that stabilizes the Slx4- 
Dpb11 interaction. Analysis of the phosphorylation status of Rad53 
in rtt107A cells showed that DDC signalling is hyperactivated (Fig. 3a), 
suggesting that Rtt107 has a similar function to Slx4 in counteracting 
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Figure 2 | Slx4 binding to Dpb11 counteracts the Dpb11-Rad9 interaction 
and Rad53 activation. a, Western blot showing the phosphorylation status of 
Rad53-HA in the indicated strains after MMS treatment. b, Co- 
immunoprecipitation (co-IP) of Dpb11 and Rad9 (see also Supplementary Fig. 
5); input levels and phosphorylation status of Rad9 are shown. c, Pull-down of 
Slx4—Flag or Rad9-HA from S. cerevisiae lysates using recombinant BRCT'” 
and BRCT*”* from Dpb11; ponceau staining of the membrane is also shown. 
d, Interaction of the indicated Flag-tagged Slx4 mutants with Dpb11-HA; input 
levels and phosphorylation status of Slx4 are shown. e, Rad53 phosphorylation 
status in the indicated s/x4 mutants. The experiment was performed as 


Rad53 activation. Rttl07 recognizes H2A?*!”° through its carboxy- 
terminal pair of BRCT domains (BRCT®®), and this region is essential 
for the role of Rttl107 in the MMS response** (Supplementary Figs 9 
and 10). Furthermore, the $129A mutation (h2a-S129A) abrogated 
Slx4 phosphorylation on MMS treatment (Fig. 3b), suggesting that 
the recruitment of Rttl07 to H2A®°!”? occurs upstream of Slx4- 
Dpb11 complex formation. Because H2A?*!”’ can function as a positive 
regulator of Rad9-dependent Rad53 activation’, we reasoned that the 
hyperactivation of Rad53 in rtt107A cells could be suppressed by h2a- 
S129A. rtt107A cells expressing the H2A(S129A) mutant had lower 
Rad53 activation and MMS sensitivity than did rtt107A cells expressing 
wild-type H2A (Fig. 3c, d). Disruption of the Rad9-H2A interaction by 
mutation of the C-terminal pair of BRCT domains of Rad9 (K1088M) 
partially rescued the MMS sensitivity of rtt107A cells (Supplementary 
Fig. lla). A similar rescue was also observed when overexpressing 
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described in Fig. la. f, MMS sensitivity assay on the indicated s/x4 mutants. 
g, Rad9 phosphorylation status in cells expressing WT or mutant Slx4 from the 
endogenous SLX4 promoter or the ADH1 promoter. h, Rad53 phosphorylation 
status in dot1A cells expressing Slx4 from the endogenous SLX4 promoter or 
the TDH3 promoter. Cells were arrested in «-factor and released for 45 min in 
medium containing MMS. i, Co-IP of Dpb11 and Rad9. j, Model for the 
mechanism by which Slx4 counteracts Rad53 activation. G1, G1 cell-cycle 
phase; Rad9-p, hypophosphorylated Rad9; Rad9-ppp, hyperphosphorylated 
Rad9; WCE, whole cell extract. 


BRCT®® of Rttl107 (Supplementary Fig. 11b). Taken together, these 
results show that, like Slx4, Rttl107 also counteracts DDC signalling. 
The anti-checkpoint function of Rtt107 depends on its recognition of 
H2A?PS!°, a step that is required subsequently for the assembly of 
the Slx4-Dpb11 complex. Thus, the Slx4—Rtt107 complex functions 
as a negative regulator of activation of the kinase Rad53 by physically 
interacting with two positive regulators of the adaptor Rad9: Dpb11 and 
HDAPS29. 

To further test the anti-checkpoint function of the Slx4—Rtt107 
complex, we developed an alternative experimental set-up in which 
the presence of Slx4 would sensitize cells to replication stress by 
decreasing DDC signalling below the level required for a proper cellu- 
lar response. For this set-up, we used cells lacking Mrcl, which is a 
checkpoint adaptor that works in parallel with Rad9. Mrcl mediates 
Rad53 activation at stalled replication forks and therefore has a more 
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Figure 3 | Rtt107 counteracts Rad9-dependent Rad53 activation by binding 
to phosphorylated histone H2A. a, Western blot showing the 
phosphorylation status of Rad53—HA in the indicated strains after MMS 
treatment. b, Western blot showing Slx4—Flag phosphorylation status. 

c, Western blot showing phosphorylation status of Rad53 and H2AP°!”’. 
Indicated strains expressing HA-tagged Rad53 were arrested in nocodazole and 
released for 60 min in medium containing MMS. d, MMS sensitivity assay on 
the indicated mutants. 


central role than Rad9 in Rad53 activation in response to hydro- 
xyurea*’. On hydroxyurea treatment, mrc1A cells rely solely on Rad9 
to activate Rad53. The interaction of Slx4 with Dpb11 was enhanced in 
mrclA cells (Fig. 4a), especially in response to hydroxyurea, suggesting 
that the Slx4-Rtt107 complex actively counteracts Rad9 in mrc1A cells 
(Fig. 4b). Consistent with our finding that the Slx4—Dpb11 interaction 
requires H2APS!?9. high levels of H2A?PS!?? accumulated close to the 
origins of replication in mrc1A cells after hydroxyurea treatment 
(Fig. 4c). Because we expect that the kinase activity of Rad53 is limiting 
in mrc1A cells, we reasoned that deletion of SLX4 would be beneficial 
in hydroxyurea-treated mrc1A cells, allowing more activation of 


a WCE IP: Slx4-Flag b 
WT mrci1A WT mrciA aed 
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Figure 4 | Slx4 sensitizes mrc1A cells to hydroxyurea-induced replication 
stress. a, Co-IP of Dpb11 and Slx4 after 2 h treatment with 0.1 M hydroxyurea 
(HU). b, Model for the action of Slx4 and Rtt107 in mrc1A cells. c, HU 
treatment of mrc1A cells leads to H2A?*!”? accumulation near an origin of 
replication. Chromatin immunoprecipitation (ChIP) analysis of H2APS!?? at 
neighbouring ARS305 regions I, II and III in the indicated strains after exposure 
to 0.2 M HU. Data are presented as mean + s.e.m. (n = 3). *, significant 
difference (P = 0.025), as analysed by unpaired, two-tailed student’s f-test. 

d, HU sensitivity assay on the indicated mutants. e, Western blot showing 
the phosphorylation status of Rad53-HA after treatment with 10 mM HU. 
kb, kilobases. 
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Rad53 through Rad9. Consistent with this idea, mrc1Aslx4A cells 
had a significantly higher hydroxyurea resistance than mrclA cells 
(Fig. 4d), and this resistance correlated with higher Rad53 activation 
(Fig. 4e). Taken together, these results show that Slx4 sensitizes mrc1A 
cells to replication stress, and they provide strong additional evidence 
that the Slx4—Rtt107 complex counteracts Rad9 in response to replica- 
tion-induced lesions (see detailed model in Supplementary Fig. 12). 
Because this function of the Slx4—Rtt107 complex depends on Mec1- 
dependent phosphorylation, we propose that Slx4—Rtt107 is involved 
in a mechanism that we have named DAMP — dampens checkpoint 
adaptor-mediated phospho-signalling — by which DDC signalling 
self-monitors its activation state. We speculate that by uncoupling 
upstream Mecl signalling from downstream Rad53 activation, DAMP 
could allow Mecl to maintain control over specific effectors, such as 
repair enzymes, without an aberrant arrest in cell-cycle progression. 


METHODS SUMMARY 


The yeast strains and plasmids used are described in Supplementary Tables 3 and 
4. The mass-spectrometry-based phosphorylation analyses of whole cell lysates or 
purified Rad53, as well as the other procedures used for cell growth and synchron- 
ization, genotoxin treatment and chromatin immunoprecipitation analysis, are 
detailed in the Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Yeast strains and plasmids. Strains generated in this study were derived either 
from MBS164 or MBS191 (both congenic to S288C) or W303 (where indicated). 
Unless indicated, all tags were inserted at the C terminus of the corresponding 
genes by homologous recombination at the genomic loci and were verified by 
western blotting. Tagged strains were assayed for sensitivity to MMS to ensure they 
behaved similarly to the wild-type strain. Sensitivity assays were independently 
confirmed in strains derived from freshly sporulated diploids. Standard cloning 
methods were used to generate the plasmids for this study. Plasmids containing 
domains of Dpb11 or Rttl07 tagged at the amino terminus with a PATH tag 
(2X protein A + TEV cleavage site + 6X His)’° were based on the pET21a vector 
(Novagen). HA-tagged full-length Rtt107, together with its native promoter, was 
cloned into pRS416 (Stratagene) to generate pMBS163. Wild-type alleles cloned 
into the pFA6a vector (Addgene) were linearized before integration into the 
respective endogenous loci. A pYES2/NT C vector (Invitrogen) containing full- 
length or C-terminal BRCTs of Rttl07 was used for overexpression. SLX4 and 
DPB11 constructs containing an ADH1 or a TDH3 promoter were generated by 
fusing the respective promoters (800 base pairs upstream of the start codon) to the 
corresponding open reading frame. The resultant PCR products were subsequently 
cloned into the pRS416 or pFA6a vector. All point mutations were generated by 
site-directed mutagenesis using either the QuikChange Multi Site-Directed 
Mutagenesis Kit (Stratagene) or the PFU Ultra II kit (Agilent). All yeast strains 
and plasmids used in this study are described in Supplementary Tables 3 and 4 and 
are available on request. 

Cell synchronization and genotoxin treatment. Yeast cells were grown in yeast 
peptone dextrose (YPD) or drop-out medium at 30 °C. Log phase cultures (optical 
density at 600 nm ~ 0.3) were subjected to o-factor (0.5 ug ml ') or nocodazole 
(1.5 1g ml~!) treatment for G1 or G2/M arrest, respectively. Cells were then 
washed and resuspended in warm medium containing the indicated genotoxin. 
Western blotting and immunoprecipitation. For western blotting, about 50 mg 
frozen cell pellet was lysed by bead beating at 4 °C in lysis buffer (50 mM Tris-HCl, 
pH7.5, 0.2% Tergitol, 150 mM NaCl, 5 mM EDTA, 1 mM phenylmethylsulphonyl 
fluoride (PMSF), Complete, EDTA-free Protease Inhibitor Cocktail (Roche) and 
PhosSTOP (Roche)). SDS loading buffer with 60 mM dithiothreitol (DTT) was 
added. Samples were separated by standard SDS-polyacrylamide gel electrophor- 
esis (SDS-PAGE). Proteins were detected using the following antibodies: anti- 
Rad53 (yc-19, 1:10,000, Santa Cruz Biotechnology), anti H2AS!”° (07-0745, 
1:10,000, Millipore), anti-HA (12CA5, 1:10,000, Roche) and anti-Flag (M2, 
1:5,000, Sigma) antibodies. For immunoprecipitation (IP), approximately 
100mg frozen cell pellet was lysed by bead beating at 4°C in lysis buffer 
(50mM Tris-HCl, pH7.5, 0.2% Tergitol, 150mM NaCl, 5mM EDTA, 1mM 
PMSF, Complete, EDTA-free Protease Inhibitor Cocktail, 5 mM sodium fluoride 
and 10 mM f-glycerophosphate). After adjusting protein concentrations to about 
6 mg ml ', inputs were aliquoted, and lysates were incubated with either anti-HA 
or anti-Flag agarose resin (Sigma) for 2-3h at 4°C. After three washes in lysis 
buffer, bound proteins were eluted with three resin volumes of SDS elution buffer 
(100 mM Tris-HCl, pH 8.0, and 1% SDS) for HA IP or of Flag peptide (Sigma) 
solution (0.5 pg ml‘ in 100 mM Tris and 0.2% Tergitol) for Flag IP. SDS loading 
buffer with DTT was added, and samples were analysed by western blotting with 
the indicated antibodies. 

Pull-down with recombinant BRCT domain. Protein domains (for Dpb11 
BRCT!?, amino acids 1-270; and BRCT*4, amino acids 271-582) containing 
an N-terminal PATH tag (see Yeast strains and plasmids) were expressed in 
Escherichia coli, bound to human IgG-agarose resin (GE Healthcare) and then 
used as bait for pull-downs from yeast lysates as previously described”. 

SILAC labelling of yeast. For mass spectrometry experiments, cells were grown in 
(—)Arg (—)Lys drop-out medium (‘light’ version complemented with normal 
arginine and lysine; ‘heavy’ version complemented with lysine '°C,, "Nj and 
arginine '*C,, °N,) for at least five generations. 

Purification of phosphopeptides by immobilized metal-ion affinity chro- 
matography (IMAC). For the purification of Rad53 phosphopeptides, approxi- 
mately 0.6 g cell pellet from wild-type (grown in heavy medium) or six4A (grown 
in light medium) strains carrying Rad53-HA was lysed by bead beating at 4 °C in 
4 ml lysis buffer (50mM Tris-HCl, pH 7.5, 0.2% Tergitol, 150 mM NaCl, 5mM 
EDTA, Complete, EDTA-free Protease Inhibitor Cocktail, 5 mM sodium fluoride 
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and 10mM -glycerophosphate). Lysates were incubated with anti- HA-agarose 
resin (Sigma) for 4h at 4°C. After three washes with lysis buffer, bound proteins 
were eluted with three resin volumes of elution buffer (100 mM Tris-HCl, pH 8.0, 
and 1% SDS). Eluted proteins from light or heavy medium were mixed together, 
reduced, alkylated and precipitated. Proteins were resuspended in a solution of 
2 Murea and 12.5 mM Tris-HCl, pH 8.0, and digested with trypsin for 16 h at 37 °C. 
Phosphopeptides were enriched using an ‘in-house’ IMAC column, then eluted 
with 10% ammonia and 10% acetonitrile and dried in a SpeedVac evaporator. 
Phosphoproteome analysis. Approximately 0.6g cell pellet from wild-type 
(grown in light medium) and s/x4A (grown in heavy medium) strains was lysed 
by bead beating at 4°C in 4 ml of lysis buffer (50mM Tris-HCl, pH7.5, 0.2% 
Tergitol, 150 mM NaCl, 5mM EDTA, Complete, EDTA-free Protease Inhibitor 
Cocktail (Roche), 5mM sodium fluoride and 10mM_ f-glycerophosphate). 
Protein lysates were denatured in 1% SDS, reduced, alkylated and then precipi- 
tated with three volumes of a solution containing 50% acetone and 50% ethanol. 
Proteins were solubilized in a solution of 2 M urea, 50 mM Tris-HCl, pH 8.0, and 
150mM NaCl, and then TPCK-treated trypsin was added. Digestion was per- 
formed overnight at 37°C, and then trifluoroacetic acid and formic acid were 
added to a final concentration of 0.2%. Peptides were desalted with a Sep-Pak 
C18 column (Waters), dried in a SpeedVac evaporator and resuspended in 1% 
acetic acid. Phosphopeptides were enriched by IMAC as previously described’**, 
reconstituted in 85 il solution containing 80% acetonitrile and 1% formic acid, and 
fractionated by hydrophilic interaction liquid chromatography (HILIC) as prev- 
iously described’*, before analysis by liquid chromatography with tandem mass 
spectrometry (LC-MS/MS). More than 3,570 phosphopeptides were identified or 
quantified, and phosphopeptides containing a phosphorylation site known to be a 
Mec] target or a target of Rad53-dependent phosphorylation” were selected. 
Mass spectrometry analysis. IMAC elutions or HILIC fractions were dried in a 
SpeedVac evaporator, reconstituted in 0.1% trifluoroacetic acid and analysed by 
LC-MS/MS using a 125 uM ID capillary C18 column and an Orbitrap XL mass 
spectrometer coupled with an Eksigent nanoflow system. Database searching was 
performed using the SORCERER system (Sage-N Research) running the program 
SEQUEST. After searching a target-decoy budding yeast database, results were 
filtered either based on probability score to achieve a 1% false positive rate or 
manual inspection. Quantification of heavy/light peptide isotope ratios was per- 
formed using the Xpress program as previously described”’. 

Chromatin immunoprecipitation (ChIP). Cultures were grown in YPD to an 
optical density at 600 nm ~ 0.3, arrested in G1 with «factor for 2 h and released in 
the presence of 200 mM hydroxyurea (HU) for 1h. Cultures were formaldehyde- 
crosslinked (1% final concentration) for 20 min followed by quenching with 125 mM 
glycine. Protein-DNA complexes were immunoprecipitated with a yeast-specific 
anti-phospho-histone (H2APS!) antibody (07-745, Millipore). Quantitative PCR 
was performed with the purified DNA from the immunoprecipitate as previously 
described” using pairs of primers designed to amplify the genome sequences shown 
in Fig. 4c. The primer sequences were as follows: 39.3Kb_for, 5'-CAAGTGG 
ATTGAGGCCACAGCA-3’, 39.3Kb_rev, 5’-CCGGACAGTACATGAAACT 
GGACA-3'; 43.1Kb_for, 5’-TCAAGGTGGCTTGATGATCGCC-3’, 43.1Kb_rev, 
5'-CACCTCCAATCTGCTTCAAGTTTGGC-3’; 47.3Kb_for, 5’-TATCTTGCG 
GGCCTTTCGTGTC-3’, and 47.3Kb_rev, 5’-GGGAGATTCCATTTCCGCA 
CCA-3’. 
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Leadership hurdles 


Asian researchers and engineers are too rarely made US 
science leaders, say Lilian Gomory Wu and Wei Jing. 


any people believe that Asians excel 
M in science, technology, engineering 
and maths (STEM) occupations in 
the United States. And indeed, there are lots of 
people of Asian descent on the country’s uni- 


versity campuses and in its STEM workplaces 


and federal laboratories. In 2009, Asians — 
defined as people from the Far East, southeast 
Asia and the Indian subcontinent — made up 
78% of doctoral recipients with temporary 
visas who were planning to work in the United 
States’. One could expect, then, that Asian men 


and women would be represented in similar 
proportions at the highest levels of STEM 
industry, academia and the federal workforce. 
But the data tell a different story. 

Across all sectors, Asians in US STEM careers 
are not reaching leadership positions at the same 
rate as white people, or even as members of 
other underrepresented groups’. In academia, 
just 42% of Asian men are tenured, compared 
with 58% of white men, 49% of black men and 
50% of Hispanic men. Just 21% of Asian women 
in academia are tenured, the lowest proportion 
for any ethnicity or gender. They are also least 
likely to be promoted to full professor. 

The industrial and federal workforces reflect 
similar numbers. Asian men are doing better 
than Asian women in reaching managerial 
positions in industry, but their numbers are 
lower than those for men of other races and 
ethnicities. Just 4% of Asian women in indus- 
try and 28% in the federal workforce hold 
managerial positions, again the smallest per- 
centage for any ethnicity or gender. 

Asians are almost absent at the very top of US 
companies. The company Leadership Educa- 
tion for Asian Pacifics, based in Los Angeles, 
California, reported’ in 2010 that there were just 
ten Asians or Pacific Islanders among the chairs, 
presidents and chief executives of the 500 big- 
gest US firms; only three of them were women. 

Why the disparity? It may be down to cul- 
tural behaviours, and Western interpretation of 
these behaviours. Asians are often stereotyped 
as a ‘model minority’: hardworking and patient, 
family oriented, good at maths and science and 
having a strong work ethic, but also humble, 
non-confrontational and lacking the passion 
to be charismatic leaders. Worse yet, a work 
group of the US government’s Equal Employ- 
ment Opportunity Commission reports’ that 
Asians are often perceived as ‘forever foreign, 
which can affect how others assess their ability 
to communicate, their competence and, more 
importantly, their trustworthiness. 

Good leadership has a cultural dimension. 
In east Asia, for example, effective leadership 
is measured by what managers do rather than 
by what they say, no matter how passionately 
they speak. A manager in charge of bringing 
out a product there would work day and night 
to get it out on time and free of defects. Com- 
munication skills are generally less important 
in this model. The idea in the United States that 
east Asians lack passion and opinions comes 
from cultural perceptions of their behaviour: in 
discussions, east Asians tend to respond slowly, 
taking time to listen to what is being said > 
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> and thus giving the appearance to Amer- 
icans that they are not engaged, are passive 
and have no opinion. These differences can 
easily lead to unintended biases. 

The problem may go beyond verbal com- 
munication. Grant applications to the US 
National Science Foundation from Asian 
principal investigators between 2004 and 
2011 have been consistently funded in 
lower proportions than those from black, 
Hispanic and white principal investigators’, 
which suggests that differences in writing 
styles may lead to biases. For example, east 
Asians’ humble demeanour could cause 
them to describe the implications of their 
research in modest terms, which might 
bring them lower ratings from reviewers. 

The idea of what makes a good leader in 
the United States needs to be re-examined. 
Cultural differences in communication 
style need further study; peer-review pan- 
els, managers and others should be trained 
to avoid biases. One model is the Strate- 
gies and Tactics for Recruiting to Improve 
Diversity and Excellence programme at the 
University of Michigan in Ann Arbor. Such 
programmes help scientists and engineers 
to be more effective in global collaborations 
and careers. At the same time, Asians need 
to recognize that hard work is not enough; 
they should seek training in communica- 
tion, assertiveness and leadership skills. 

The inequalities that mark the career 
arcs of Asian scientists and engineers in 
the United States are not widely discussed; 
the science community needs to bring 
greater attention to the data. We also need 
to look at whether Asians are recognized for 
their achievements, and whether they are 
receiving awards and becoming members 
of the US National Academies in numbers 
roughly equivalent to the proportion of 
Asians who rise to the level of full professor. 

Diversity is said to be a strength of the 
United States. If cultural differences are 
recognized and respected, the country’s 
scientific enterprise is sure to benefit. m 
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TURNING POINT 


Sarah Blackford 


Science-careers adviser Sarah Blackford, head 
of education and public affairs at the Society 
for Experimental Biology in Lancaster, UK, 
assumed that she would be a research scientist. 
But after she landed a contract-research post, 
she realized that her interests lay elsewhere, and 
she manoeuvred through a series of jobs from 
journal publishing to careers development. In 
October, Blackford published her first book, 
Career Planning for Research Bioscientists 
(Wiley-Blackwell). She is on the steering 
committee for the Naturejobs Career Expo. 


What did you hate about research? 
I used to find it really tough doing the experi- 
ments. I am just not a very practical, technical 
person, and don’t follow protocol very well — I 
cant go by a recipe in the kitchen. 


But were there aspects that you enjoyed? 
Presenting results in papers and posters, and 
going to conferences. | also liked interacting 
with people — negotiating for equipment, for 
example. When my contract came to an end, 
I thought about scientific publishing. I would 
have a foot in science, but would not be doing 
lab work. Everyone breathed a sigh of relief. 


How did you transfer to careers advising? 

I was the assistant editor at the Journal of 
Experimental Botany, based at the University 
of Southampton, UK, and biologists there kept 
bringing me their CVs — because I worked 
in publishing, they thought I would be a guru 
on language and writing. I enjoyed helping 
them, so I started volunteering at the univer- 
sity’s career-services centre. I helped with CV 
workshops and sat in on interviews. 


How did you move into paid careers advising? 
My job relocated to Lancaster University when 
the journal editor changed, so I went to volun- 
teer with their career services — and this was 
the turning point for my life. They were looking 
for someone to cover for a person on sabbatical, 
and I got the job. It was for only three months, 
but I knew I had to take it — and as it turned 
out, the job lasted for two years. After that, [had 
enough experience to get a job at the University 
of Leeds, UK, for a year and a half, where I wrote 
marketing plans, organized conferences, liaised 
with employers, ran careers workshops. 


What prompted you to write a book? 

I missed working with scientists, and a job came 
up with the Society for Experimental Biology 
involving career development, science com- 
munication and education. I have been in the 
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post since 1998. A few years ago, while running 
careers workshops at a conference in Finland, 
I was chatting with a marketing manager and 
said flippantly that one day I would put all this 
information into a book. When I got back to my 
office, I had an e-mail from the commissioning 
editor at a publishing house saying he under- 
stood that I was thinking about writing a book. 


Why did you focus on bioscience careers? 

All this valuable careers information is being 
directed to people at conferences, but there was 
almost nothing in writing for bioscientists. 


Do you have advice for biomedicine postdocs? 
They need to keep learning new techniques and 
skills. They need to campaign for better con- 
tracts, the right to develop management skills, 
the opportunity to teach or do whatever they 
want to do to improve their career prospects. 
They cant let their supervisor steer for them. 
Universities are employing fewer technicians 
now, and postdocs are in danger of becoming 
supertechs. They also need to decide whether 
taking a third postdoc is an advantage. It may 
be convenient, but they ought to ensure that it 
will build on their current capabilities so that 
they are improving their career prospects. 


What caveats do you find yourself repeating 

to early-career biomedical researchers? 

You have to sell yourself. One of the easiest ways 
is through social media and networks. You 
need to network, because it is other people who 
get you jobs. Postdocs especially aren't using 
social media and networks enough: LinkedIn, 
for example, is extremely valuable because a 
lot of recruiters use it. You can meet influential 
people online — modern networks are very 
democratic. Opportunities are out there. m 


INTERVIEW BY KAREN KAPLAN 


Ua SCIENCE FICTION 


BY DEBORAH WALKER 


ew Year’s Eve and it’s snowing out- 
Ne Of course it is. They switched 

on the weather machines on 
1 December. Snow for the holidays. We're 
in the living room, waiting for the first foot. 
The first person through the door who will 
bring our luck for the year. 

Uncle Milo’s dozing in his chair. 
He’s as drunk as a skunk. He was 
never a heavy drinker. But we all 
have our quotas. 

Uncle Milo and Auntie Val 
live with us. They used to havea 
nice retirement flat overlooking 
the harbour. But families should 
live together. 

The lights are dim, the red 
fairy lights on the Christmas 
tree are winking and record- 
ing, checking that the traditions 
are being upheld. I wonder if any 
observers are tuned in. Are we pro- 
viding good entertainment? 

I go to the window, stare out into 
the street. There’s a hundred families, 
a hundred houses in a circle. A hundred 
houses with no back doors or windows. 
There are lights in every sitting room. We're 
all waiting for our first foot. I can see them, 
tall dark figures in the cold, waiting to bring 
our luck. 

With a peal of bells from a non-existent 
church, the year turns. 

“Happy New Year,’ shouts Mother. When 
the twins jump up and down, shouting in 
excitement, Mother looks relieved. Shed 
spent all afternoon coaching them. I smile 
at her, trying to tell her, that I understand. 
I really do. 

She pats my cheek. “Happy New Year, 
Brenna.” 

Auntie Val rouses Uncle Milo. We link 
hands and sing Auld Lang Syne. 

Mother looks towards the door. “Where is 
he? Where is he?” she whispers. 

“Tt’s all nonsense.” Uncle Milo’s eyes 
are red and bleary. He strides towards 
the Christmas tree and glares at it. “It’s all 
nonsense.” His words fall like the heavy 
snow. 

I pull the twins closer to me. They're 
still bewildered, after 


seeing Santa deliv- NATURE.COM 
ering the presents.  FollowFutures: 
Santa shouldn't ooze = @NatureFutures 
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FIRST FOOT 


Everything as it should be. 


Sometimes the enforcers get things wrong. 
But it’s no good complaining. 

“Ts it going to be all right?” asks Corey. 
Jane buries her face into my shoulder. No, 
I feel like saying. Its not going to be all right. 
How can it be? But it’s not fair to them. I 
smile and tell them that it’s going to be fine. 
They should be allowed to hope. They’re 
only kiddies. 

Auntie Val is trying to calm Uncle Milo. 
“Hush now,’ she says, laying a hand on his 
shoulder. Auntie Val looks so frail. Since the 
abduction, she’s grown smaller and smaller, 
despite all the food she’s had to eat during 
the season's celebrations. I’m worried about 
Auntie. I'm worried about us all. I’m all worn 
out with the worry. Worry is a knife, and it’s 
whittled me hollow. 

“IT wont be hushed.” Uncle Milo’s voice 
grows louder, accumulating like a rolling 
snowball. “We. Can't. Live. Like. This. You 
cant force us to enjoy ourselves.” He takes a 
swing at the tree. The red fairy lights flash. 
Uncle stumbles. Mother gives a little gasp. 
We both reach out to try to catch him. But 
we're too slow. He falls to the floor, landing 
badly on the wooden abattoir that Santa 
bought Corey for Christmas. 
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“Tm okay. I’m okay,’ he says. “Don't fuss 
so. Don't; he says, shaking off Auntie Val’s 
offer of help. He sounds as if he hates her. 

Auntie Val begins to cry. So do the twins. 

“Unseasonal behaviour wont be toler- 
ated.” The enforcer’s voice fills the room. 

Mother says quickly, “But it’s traditional for 
people to lose their rag at this time of year.” 
“Ensure that it’s an isolated incident.” 
“T will, thank you.” 

There's a knock at the door. 

“Answer it, Brenna,” says 
Mother. 

I open the door. He’s tall 
and he’s dark. Father smiles 
at me, but he looks puzzled. 

I feel uneasy. Usually Father's 

so good at playing his part. 
“Where were you?” hisses 

Mother. “Uncle Milo’s had a 

turn.” 

Uncle’s still on the floor, 
sobbing. 

“I... met someone.” 

“Just get it over with,” says 
Mother. 

Father holds a handful of silver 

coins, a lump of coal and a twist of salt. 
He shouts out: “Happy New Year t’ye! God 
send ye plenty! Where ye have one pound 
note, I wish ye have twenty.” He passes out 
the gifts, giving me the salt twist. 

I unwrap it and taste the salt with the tip 
of my finger. Then I let the salt fall to the 
floor, and quickly shove the wrapper into 
my pocket. The paper’s printed with a red 
circle: the sign of the resistance. I'd heard 
rumours but I never thought they could be 
true. Can anyone fight the enforcers? Can 
anyone escape this z00? 

Mother gives Father a mince pie. It’s 
dusted with blue sugar. Father looks at it for 
a moment, before eating it with two quick 
bites. 

Mother sighs. “That’s it, then. Let's get to 
bed” 

“Happy New Year, Mother.’ I hug her. 

“Happy New Year, darling.” 

[help the twins to bed, thinking about the 
first foot who brings the luck for the New 
Year. And I’m thinking about the resistance. 
The New Year brings hope. m 


Deborah Walker grew up in the most 
English town in the country, but she soon 
high-tailed it down to London, where she 
now lives with her partner, Chris, and her 
two young children. 


JACEY 


